lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "michal dia (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (LUCENENET-544) Turkish stemmer causes an IndexOutOfRange
Date Wed, 23 Dec 2015 08:52:46 GMT

    [ https://issues.apache.org/jira/browse/LUCENENET-544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15069396#comment-15069396
] 

michal dia edited comment on LUCENENET-544 at 12/23/15 8:52 AM:
----------------------------------------------------------------

I have the same problem. Is there a fix for this?

to reproduce create a c# console application with the following code:
{code}
 TurkishStemmer ts = new TurkishStemmer();
ts.SetCurrent("faydaland")
ts.Stem();
// will throw an exception
{code}

We noticed that for words with length > 2 that ends with 'd' or 'g' will throw an index
out of range exception in the method "find_among_b(Among[] v, int v_size).


was (Author: mdiaman):
I have the same problem. Is there a fix for this?

to reproduce create a c# console application with the following code:
{code}
  TurkishStemmer ts = new TurkishStemmer();
ts.SetCurrent("faydaland")
ts.Stem();
// will throw an exception
{code}

We noticed that for words with length > 2 that ends with 'd' or 'g' will throw an index
out of range exception in the method "find_among_b(Among[] v, int v_size).

> Turkish stemmer causes an IndexOutOfRange
> -----------------------------------------
>
>                 Key: LUCENENET-544
>                 URL: https://issues.apache.org/jira/browse/LUCENENET-544
>             Project: Lucene.Net
>          Issue Type: Bug
>          Components: Lucene.Net Contrib
>         Environment: Windows 7 x64, .NET 4.0
>            Reporter: Peter Wright
>         Attachments: Program.cs, Turkish.txt, Turkish3.txt, Visual Studio with exception
details.png
>
>
> Certain words in the Turkish stemmer is causing SnowballProgram to throw an index out
of range exception in the method "find_among_b(Among[] v, int v_size)".
> Line 353
> diff = current[c - 1 - common] - w.s[i2];
> current[c - 1 - common]	'current[c - 1 - common]' threw an exception of type 'System.IndexOutOfRangeException'
char {System.IndexOutOfRangeException}
> -		current	{ı}	System.Text.StringBuilder
> 		Length	1	int
> 		c	8	int
> 		common	0	int



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message