[ https://issues.apache.org/jira/browse/LUCENENET-119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Digy updated LUCENENET-119:
---------------------------
Attachment: CharTokenizer.patch
> CharTokenizer can not detect the end of stream
> ----------------------------------------------
>
> Key: LUCENENET-119
> URL: https://issues.apache.org/jira/browse/LUCENENET-119
> Project: Lucene.Net
> Issue Type: Bug
> Environment: Lucene.Net 2.3.1
> Reporter: Digy
> Attachments: CharTokenizer.patch
>
>
> CharTokenizer can not detect the end of stream since input stream does not return -1
on EOS, instead it returns 0.
> {code}
> dataLen = input.Read(....)
> if (dataLen == -1 )
> {code}
> Analyzers that use that class (like WhitespaceAnalyzer, SimpleAnalyzer etc.) return
tokens with length 255 at the end (only last token).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
|