From erik@sran8.sra.co.jp Mon Nov 19 07:56:01 1990
Received: from MCSUN.EU.NET by dkuug.dk via EUnet with SMTP (5.64+/8+bit/IDA-1.2.8)
	id AA17682; Mon, 19 Nov 90 07:56:01 +0100
Received: by mcsun.EU.net with SMTP; Mon, 19 Nov 90 07:59:21 +0100
Received: from srava.sra.co.jp by srawgw.sra.co.jp (5.64WH/1.4)
	id AA23718; Mon, 19 Nov 90 15:59:05 +0900
Received: from sran8.sra.co.jp by srava.sra.co.jp (5.64b/6.4J.6-BJW)
	id AA20156; Mon, 19 Nov 90 15:59:07 +0900
Received: from localhost by sran8.sra.co.jp (4.0/6.4J.6-SJ)
	id AA25789; Mon, 19 Nov 90 15:57:42 JST
Return-Path: <erik@sran8.sra.co.jp>
Message-Id: <9011190657.AA25789@sran8.sra.co.jp>
Reply-To: erik@sra.co.jp
From: Erik M. van der Poel <erik@sra.co.jp>
To: keld@dkuug.dk
Cc: arnet@hpcupt1.cup.hp.com, arnet@hpda.cup.hp.com, i18n@dkuug.dk,
        unicode@sun.com
Subject: Re: Han Character Code Ordering
Date: Mon, 19 Nov 90 15:57:40 +0900
Sender: erik@sran8.sra.co.jp
X-Charset: ASCII
X-Char-Esc: 29

> > As far as Unicode and collation are
> > concerned, UniHan is probably the way to go. ISO 10646 is somewhat at
> > a disadvantage in this respect.
> 
> I would like to have this spelled out:
> For Unicode UniHan is OK, that is ordering by most common pronounciation
> for the single Han character is OK?

Actually, if you look carefully at Joe's message, you will notice that
the UniHan scheme orders the characters by radical/stroke only:

# The layout of the Unicode "UniHan" collection is:
#     Level 1: radical/stroke
#     Level 2: radical/stroke
#     Level 3: radical/stroke
#     ....

I think that UniHan is the way to go for Unicode because it is simple
and uniform. You cannot do this for pronunciations because the
pronunciations differ from country to country. (Of course, the
radical/stroke count differs between mainland China (simplified) and
Taiwan (traditional), but I won't go into that here. Ain't my problem.
:-)


> Can this ordering also be used for 10646 and X0208 and X0212?

Well, you can't change the ordering of 0208 because that's carved in
stone. 0212 seems to be largely based on radical/stroke already.

But for 10646, we may still be able to send some ideas to put some or
all of the Han characters (in the basic multilingual plane and in each
country's planes) in some radical/stroke order.


> Is this ordering (on pronounciation) also OK for China and Korea?
> Or are Chinese and/or Koreanese more used to radicals/stroke count
> ordering?
> 
> Keld Simonsen

Joe wrote a bit about this too:

# Phonetic-ordered dictionaries are more common in China, since the simplified
# characters are harder to accommodate in the radical/stroke system.

I don't know about Korean. Perhaps someone else on this list would
like to comment?


Erik

