[r6rs-discuss] R5RS is not a baseline

Robby Findler robby at eecs.northwestern.edu
Sat Feb 21 16:59:27 EST 2009


On Sat, Feb 21, 2009 at 3:24 PM, Eli Barzilay <eli at barzilay.org> wrote:
>> Sure, if you want.  Or you can use various forms of normalization,
>> some of which are standardized by Unicode and some not, to throw
>> away any unwanted distinctions.  For example, if you are analyzing
>> Chinese text, you may want to throw away the difference between
>> Simplified and Traditional characters -- not that it's trivial to do
>> so.
>
> Exacly my point -- these various forms of normalizations are more
> fragile, and the selection of the normalizations you'd want to have is
> also less obvious, and they're all things that are inherently
> cultural.  So, as a hacker, I find it much easier to just ignore it
> all and look at the bits instead.  (It's convenient to have Unicode as
> a very difficult piece of work that I didn't have to deal with...)

And, for some people, I suspect that normalizing traditional into
simplified would be a political no no (and the other way would
probably make your text hard to read for lots of people).

Robby



More information about the r6rs-discuss mailing list