lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Itamar Syn-Hershko (JIRA)" <>
Subject [jira] [Commented] (LUCENENET-551) Latin language Stemmer (feature request)
Date Mon, 02 Feb 2015 16:58:35 GMT


Itamar Syn-Hershko commented on LUCENENET-551:

We are currently in the process of porting Lucene 4.8.0. Once we are done we will have plenty
of new languages supported:

However, it doesn't seem like this Latin analyzer is supported. When we get to that stage
I will look into it.

> Latin language Stemmer (feature request)
> ----------------------------------------
>                 Key: LUCENENET-551
>                 URL:
>             Project: Lucene.Net
>          Issue Type: Improvement
>          Components: Lucene.Net Contrib
>            Reporter: Peter Halasz
> I would find a Latin language stemmer very helpful. The Schinke Latin stemming algorithm
has been converted to Snowball here:
. I have not worked out how to compile Snowball into .cs to try it.
> There are currently 5 romance-languages supported (French, Spanish, Portuguese, Italian,
Romanian). so if the above doesn't work, I imagine one of these could be modified to support
> I realise SF.Snowball is considered a contrib package rather than core, but Lucene.Net
seems to be the main place where Snowball stemmers are provided and maintained for C# / .Net.
> Note, other language ports of Snowball support Latin (using the Schinke contribution),
such as Ruby:

This message was sent by Atlassian JIRA

View raw message