ietf-nntp wildmat routines and text

Martin J. Duerst duerst at w3.org
Wed Jul 26 19:37:01 PDT 2000


At 00/07/26 18:51 +0100, Clive D.W. Feather wrote:
>Andrew Gierth said:
> > One additional comment on wildmat syntax: especially when handling
> > UTF-8, it would be useful to support numeric \-escapes (e.g. \040
> > or \x1234), even in []-sets.
>
>If you're going to do that, can I suggest that you steal the syntax from
>C99:
>
>     \uxxxx           means U+0000xxxx in the ISO 10646 character set
>     \Uxxxxxxxx       means U+xxxxxxxx in the ISO 10646 character set
>
>(the xs are hexadecimal digits).

I suggest to change that to e.g. \uxxxx and \Uxxxxxx, i.e. only
six digits for the second one. Both the Unicode consortium and
ISO/IEC JTC1/WG2/SC2 have agreed to not encode any characters
beyond \U10FFFF, so defining eight digits is clear overkill.


Regards,  Martin.




More information about the ietf-nntp mailing list