lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shad Storhaug (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (LUCENENET-551) Latin language Stemmer (feature request)
Date Sun, 23 Apr 2017 23:26:04 GMT

     [ https://issues.apache.org/jira/browse/LUCENENET-551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Shad Storhaug updated LUCENENET-551:
------------------------------------
    Affects Version/s: Lucene.Net 4.8.0
                       Lucene.Net 3.0.3
          Component/s: Lucene.Net.Analysis.Common

> Latin language Stemmer (feature request)
> ----------------------------------------
>
>                 Key: LUCENENET-551
>                 URL: https://issues.apache.org/jira/browse/LUCENENET-551
>             Project: Lucene.Net
>          Issue Type: Improvement
>          Components: Lucene.Net Contrib, Lucene.Net.Analysis.Common
>    Affects Versions: Lucene.Net 3.0.3, Lucene.Net 4.8.0
>            Reporter: Peter Halasz
>
> I would find a Latin language stemmer very helpful. The Schinke Latin stemming algorithm
has been converted to Snowball here: http://snowball.tartarus.org/otherapps/schinke/intro.html
. I have not worked out how to compile Snowball into .cs to try it.
> There are currently 5 romance-languages supported (French, Spanish, Portuguese, Italian,
Romanian). so if the above doesn't work, I imagine one of these could be modified to support
Latin.
> I realise SF.Snowball is considered a contrib package rather than core, but Lucene.Net
seems to be the main place where Snowball stemmers are provided and maintained for C# / .Net.
> Note, other language ports of Snowball support Latin (using the Schinke contribution),
such as Ruby: https://github.com/aurelian/ruby-stemmer



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message