lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shad Storhaug (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (LUCENENET-547) Replace Spanish suffixes by Portuguese suffixes in the Portuguese snowball stemmer
Date Wed, 28 Jun 2017 19:14:00 GMT

     [ https://issues.apache.org/jira/browse/LUCENENET-547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Shad Storhaug updated LUCENENET-547:
------------------------------------
    Component/s: Lucene.Net.Analysis.Common

> Replace Spanish suffixes by Portuguese suffixes in the Portuguese snowball stemmer
> ----------------------------------------------------------------------------------
>
>                 Key: LUCENENET-547
>                 URL: https://issues.apache.org/jira/browse/LUCENENET-547
>             Project: Lucene.Net
>          Issue Type: Bug
>          Components: Lucene.Net Contrib, Lucene.Net.Analysis.Common
>            Reporter: Helder
>              Labels: stemmer
>
> On PortugueseStemmer.cs[1], there are a few suffixes in the PortugueseStemmer which I
believe were copied by mistake from SpanishStemmer[2]:
> * "log\u00EDas" should be "logias" (line 137)
> * "log\u00EDa" should be "logia" (line 113)
> * "uciones" should be "uções" (line 139)
> * "uci\u00F3n" should be "ução" (line 120)
> For more details, see the original report on nltk project:
> https://github.com/nltk/nltk/issues/754
> [1] https://github.com/apache/lucene.net/blob/master/src/contrib/Snowball/SF/Snowball/Ext/PortugueseStemmer.cs
> [2] https://github.com/apache/lucene.net/blob/master/src/contrib/Snowball/SF/Snowball/Ext/SpanishStemmer.cs



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message