Hi All this is an article on unicode I found when cruising through Slashdot.org. It's a bit old 2001, but an interesting read if you do multiple language sites.
Reminded me of a big debate went on between an early unicode adoptation and national codepage supportes in Japan.... the issue presented is not so much unicode but the way basic code page is structured.... unicode does not support "gaiji" (external charsets) in Japanese which are old/classic Chinese charsets used in Japan... all non-standard Toyo kanji (standard Japanese) are written/developed/mapped into "gaiji" (external charset) tables of code pages... You may find the same heated discussion of design issues for code pages in late 70s and early 80s when unicode started to be proposed as an alternate "multilingual" approach to "integrate" CJK "national" code pages into a single common codepage/charset...
Unicode is also still in an evolutionary phase in handling all these issues, I assume, it is not all mightly single solution for everything for every language... but still is the best environment for "multilingual" sites on a common or single encoding/charset architecture....
Pure personal observation... some interesting reading there in Slashdot.org