ietf-nntp Wildmats

Andrew Gierth andrew at erlenstar.demon.co.uk
Wed Nov 22 11:57:00 PST 2000


>>>>> "Charles" == Charles Lindsey <chl at clw.cs.man.ac.uk> writes:

 [Clive]
 >> "a"           is a character
 >> "\u0061"      is a 6 character sequence representing it
 >> %xC0 %xA3     is the UTF-encoding of the pound sterling sign which
 >> is a single character

I think you mean %xC2 %xA3 (there are no valid UTF-8 sequences
starting with %xC0 or %xC1)

 >> "\u00A3"      is a 6 character sequence representing it

 Charles> ITYM "\uC0A3"

no, he meant \u00A3

The UCS-4 code for the GBP sign is 000000A3, which encodes in UTF-8 as
the two octets C2 A3 (binary 110[00010] 10[100011] --> 00010100011).
The \u escape encodes the original character, not its UTF-8 
representation.

-- 
Andrew.



More information about the ietf-nntp mailing list