lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Helder (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENENET-547) Replace Spanish suffixes by Portuguese suffixes in the Portuguese snowball stemmer
Date Thu, 16 Oct 2014 11:36:33 GMT
Helder created LUCENENET-547:
--------------------------------

             Summary: Replace Spanish suffixes by Portuguese suffixes in the Portuguese snowball
stemmer
                 Key: LUCENENET-547
                 URL: https://issues.apache.org/jira/browse/LUCENENET-547
             Project: Lucene.Net
          Issue Type: Bug
            Reporter: Helder


On PortugueseStemmer.cs[1], there are a few suffixes in the PortugueseStemmer which I believe
were copied by mistake from SpanishStemmer[2]:

* "log\u00EDas" should be "logias" (line 137)
* "log\u00EDa" should be "logia" (line 113)
* "uciones" should be "uções" (line 139)
* "uci\u00F3n" should be "ução" (line 120)

For more details, see the original report on nltk project:
https://github.com/nltk/nltk/issues/754

[1] https://github.com/apache/lucene.net/blob/master/src/contrib/Snowball/SF/Snowball/Ext/PortugueseStemmer.cs

[2] https://github.com/apache/lucene.net/blob/master/src/contrib/Snowball/SF/Snowball/Ext/SpanishStemmer.cs



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message