[r6rs-discuss] Lexical syntax: unicode category "Co"

John Cowan cowan at ccil.org
Thu Sep 13 09:44:35 EDT 2007


Abdulaziz Ghuloum scripsit:

> Section 4.2 Lexical syntax (page 12) defines what "constituents" are  
> by enumerating their unicode categories.  The inclusion of the  
> category "Co" is interesting since it's a unicode "private use"  
> category containing only the following markers:
> 
> E000;<Private Use, First>;Co;0;L;;;;;N;;;;;
> F8FF;<Private Use, Last>;Co;0;L;;;;;N;;;;;
> F0000;<Plane 15 Private Use, First>;Co;0;L;;;;;N;;;;;
> FFFFD;<Plane 15 Private Use, Last>;Co;0;L;;;;;N;;;;;
> 100000;<Plane 16 Private Use, First>;Co;0;L;;;;;N;;;;;
> 10FFFD;<Plane 16 Private Use, Last>;Co;0;L;;;;;N;;;;;

Note that this is a convention to keep UnicodeData a bit smaller;
the properties apply to *all* the characters in the ranges
E000-F8FF, F0000-FFFFD, and 100000-10FFFD.

-- 
John Cowan  cowan at ccil.org   http://ccil.org/~cowan
Assent may be registered by a signature, a handshake, or a click of a computer
mouse transmitted across the invisible ether of the Internet. Formality
is not a requisite; any sign, symbol or action, or even willful inaction,
as long as it is unequivocally referable to the promise, may create a contract.
       --Specht v. Netscape



More information about the r6rs-discuss mailing list