Last updated at 10:34 am UTC on 16 December 2015
As of December 2015 no Unicode collation sequences have been implemented in Squeak.
Unicode collation algorithm
Note by Dale H. about implementation problems:
(Mailing list December 2015)
I think that the issue (from a performance perspective) is that you can't depend upon the value of the code point when doing collation — the main algorithm is pretty much table based — In addition to the different sort orders based on characters there are even more arcane sort rules where characters at the end of a word can affect the sort order of the word (for more info see).
It is worth looking at the Conformance section of the Unicode spec as there are different levels of collation conformance .....
ICU conforms to to UTS #10, the highest level of conformance ...
It looks like TwitterCLDR uses the Main Algorithm with tailoring. They don't claim to be conformant to the Unicode Collation Algorithm, but they are covering a big chunk of the standard use cases ....