亼? 二? Ordered lists & CJK ideographs
Sinoglot is getting another facelift. More on that later.
One of the things that we’re going to great pains to ensure is cross-everything compatibility. Unless you use Opera. More on that later too.
Part of this cross-browser, cross-system, cross-whatever-else compatibility is making sure everything is HTML5, CSS3 compliant. This in turn has had me poring over standards references to find the goodies that would make it all work regardless of the device the person reading the posts (you) was using.
W3C, in a reference dated November 2002 and re-done in 2009, provides a few nice ways to sort numbered lists. These include “cjk-ideographic” (一 二 三 四 五…), “japanese-formal,” “-informal” and a few other names which end up being “壹 貳 參 肆 伍 陸 柒 捌 玖…”. There’s also cjk-earthly-branch (子 丑 寅 卯 辰 巳 午 未 申 酉 戌 亥) and cjk-heavenly-stem (甲 乙 丙 丁 戊 己 庚 辛 壬 癸), which are nice to have.
The one I don’t quite get is called “trad-chinese-formal”. It follows “cjk-ideographic” more or less, but then 二 is replaced by 亼. What the hell is 亼? Here’s the code from W3C for the simplified variant of the set:
Digit 1 一 U+4E00
Digit 2 亼 U+4EBC
Digit 3 三 U+4E09
Digit 4 四 U+56DB
Digit 5 五 U+4E94
Digit 6 六 U+516D
Digit 7 七 U+4E03
Digit 8 八 U+516B
Digit 9 九 U+4E5D
The character 亼, pronounced jí and given by Unihan as “to assemble, to gather together,” has no apparent business being there in place of 二. The 康熙字典 entry is below:
【子集中】【人字部】 亼； 康熙笔画：3； 页码：页91第03(点击查看原图)