lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Prescott Nasser <>
Subject Umlauts as Char
Date Tue, 08 Feb 2011 01:55:11 GMT

Hey all, 
So while digging into the code a bit (and pushed by digy's Arabic conversion yesterday). I
started looking at the various other languages we were missing from java.
I started porting the GermanAnalyzer, but ran into an issue of the Umlauts...
in the void subsitute function you'll see them:
        else if ( buffer.charAt( c ) == 'ü' ) {
          buffer.setCharAt( c, 'u' );

This does not constitue a character in .net (that I can figure out) and thus it doesn't compile.
The .java file says encoded in UTF-8. I was thinking maybe I could do the same thing in VS2010,
but I'm not finding a way, and searching on this has been difficult.
Any ideas?
View raw message