lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Türkay Kaynak (JIRA) <j...@apache.org>
Subject [jira] [Comment Edited] (LUCENENET-544) Turkish stemmer causes an IndexOutOfRange
Date Thu, 24 Dec 2015 20:20:49 GMT

    [ https://issues.apache.org/jira/browse/LUCENENET-544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15070864#comment-15070864
] 

Türkay Kaynak edited comment on LUCENENET-544 at 12/24/15 8:20 PM:
-------------------------------------------------------------------

Obviously, i did not have time to understand this code :(, I' investigate other ports (same
differences avaible from java prot port) and other language stemmers and apply this. I tested
with basic texts and didnt continue for my workload...

Regards


was (Author: arionwm):
Obviously, i did not have time to understand this code :(, I' investigate original source
code (http://snowball.tartarus.org/algorithms/turkish/stem_Unicode.sbl) (same differences
avaible from original code) and other stemmers and apply this. I tested with basic texts and
didnt continue for my workload...

Regards

> Turkish stemmer causes an IndexOutOfRange
> -----------------------------------------
>
>                 Key: LUCENENET-544
>                 URL: https://issues.apache.org/jira/browse/LUCENENET-544
>             Project: Lucene.Net
>          Issue Type: Bug
>          Components: Lucene.Net Contrib
>         Environment: Windows 7 x64, .NET 4.0
>            Reporter: Peter Wright
>         Attachments: Program.cs, Turkish.txt, Turkish3.txt, TurkishStemmer.TemporarySolution.cs,
Visual Studio with exception details.png
>
>
> Certain words in the Turkish stemmer is causing SnowballProgram to throw an index out
of range exception in the method "find_among_b(Among[] v, int v_size)".
> Line 353
> diff = current[c - 1 - common] - w.s[i2];
> current[c - 1 - common]	'current[c - 1 - common]' threw an exception of type 'System.IndexOutOfRangeException'
char {System.IndexOutOfRangeException}
> -		current	{ı}	System.Text.StringBuilder
> 		Length	1	int
> 		c	8	int
> 		common	0	int



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message