lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Satish P Wani (JIRA)" <j...@apache.org>
Subject [jira] Created: (LUCENENET-386) Stop words in Unicode/UTF8 format is not working with Lucene .Net 2.9.2
Date Sat, 18 Dec 2010 10:24:01 GMT
Stop words in Unicode/UTF8 format is not working with Lucene .Net 2.9.2
-----------------------------------------------------------------------

                 Key: LUCENENET-386
                 URL: https://issues.apache.org/jira/browse/LUCENENET-386
             Project: Lucene.Net
          Issue Type: Wish
         Environment: Windows XP, Lucene .Net
            Reporter: Satish P Wani


We are using Lucene .Net version 2.9.2. We tried giving Unicode/UTF8  stop words (words in
Devnagari) using following 3 cases. All below cases are not working. Please let us know whether
any one has tried this.

// Case 1
string StopWordsFilePath="D:\\lucene\stopword.txt" 
FileInfo StopWordsList = new FileInfo(StopWordsFilePath);           
StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(Lucene.Net.Util.Version.LUCENE_CURRENT,
StopWordsList);
IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag);

// Case 2
string StopWordsFilePath="D:\\lucene\stopword.txt"   // File type of this file is UTF-8
string[] stopWords = File.ReadAllLines(StopWordsFilePath, Encoding.UTF8);
StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(stopWords);
IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag);

// Case 3
string StopWordsFilePath="D:\\lucene\stopword.txt"  // File type of this file is Unicode
string[] stopWords = File.ReadAllLines(StopWordsFilePath, Encoding.Unicode);
StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(stopWords);
IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag);

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message