lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Marco Dissel (JIRA)" <>
Subject [jira] Created: (LUCENENET-23) add c# version of to contrib section
Date Mon, 09 Oct 2006 20:37:20 GMT
add c# version of to contrib section

                 Key: LUCENENET-23
             Project: Lucene.Net
          Issue Type: New Feature
         Environment: n/a
            Reporter: Marco Dissel

I've converted the to c#... Maybe George can add this to the contrib

original file is stored at :

Fuzzifies ALL terms provided as strings and then picks the best n differentiating terms.
In effect this mixes the behaviour of FuzzyQuery and MoreLikeThis but with special consideration
of fuzzy scoring factors.
This generally produces good results for queries where users may provide details in a number
of  fields and have no knowledge of boolean query syntax and also want a degree of fuzzy matching
a fast query.

For each source term the fuzzy variants are held in a BooleanQuery with no coord factor (because
we are not looking for matches on multiple variants in any one doc). Additionally, a specialized
TermQuery is used for variants and does not use that variant term's IDF because this would
favour rarer terms eg misspellings. Instead, all variants use the same IDF ranking (the one
for the source query  term) and this is factored into the variant's boost. If the source query
term does not exist in the index the average IDF of the variants is used. @author maharwood

ps. there's no java test class...



This message is automatically generated by JIRA.
If you think it was sent incorrectly contact one of the administrators:
For more information on JIRA, see:


View raw message