lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stefan Bodewig <bode...@apache.org>
Subject Re: Umlauts as Char
Date Tue, 08 Feb 2011 05:01:27 GMT
On 2011-02-08, Nicholas Paldino [.NET/C# MVP] wrote:

> 	You can simply use the Unicode escape sequence in code and in
> string/character literals, as specified by section 2.4.2 of the C# spec
> (http://msdn.microsoft.com/en-us/library/aa664670(v=vs.71).aspx):

I think in Prescott's case part of the problem is that he doesn't know
which character the sequence seems to be.  In this case it likely is an
ü.

>         else if ( buffer.charAt( c ) == 'ü' ) {
>           buffer.setCharAt( c, 'u' );
>         }

> 	Would become:

>         else if ( buffer.charAt( c ) == '\u00C3¼' ) {
>           buffer.setCharAt( c, 'u' );
>         }

No.  The two bytes are part of a two byte UTF-8 sequence making up a
single character.

Stefan

Mime
View raw message