lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ben Martz (JIRA)" <j...@apache.org>
Subject [jira] Commented: (LUCENENET-350) Performance enhancement in FastVectorHighlighter
Date Wed, 17 Mar 2010 19:35:27 GMT

    [ https://issues.apache.org/jira/browse/LUCENENET-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846545#action_12846545
] 

Ben Martz commented on LUCENENET-350:
-------------------------------------

Agreed. I was just concerned about the code divergence considering the core guideline of remaining
1:1 with the original Java code while considering this contribution as a valuable (and unique
to Lucene.Net) branch of an existing contrib item. I'm at a loss for a better idea though
right now.

> Performance enhancement in FastVectorHighlighter
> ------------------------------------------------
>
>                 Key: LUCENENET-350
>                 URL: https://issues.apache.org/jira/browse/LUCENENET-350
>             Project: Lucene.Net
>          Issue Type: Improvement
>            Reporter: Bianco Veigel
>         Attachments: FieldTermStack.patch, FieldTermStack.patch, VectorHighlightMapper.cs
>
>
> I've had some performace issues with highlighting large documents (>25MB plain text,
> 11000 Terms per Field)
> This can be usual if your indexing i.e. log or trace files.
> Most of the time is spent loading the field value and stored TermVectors and Offets and
itering over this List.
> I've build a TermVectorMapper which filters this List, by the searched terms, so that
the time is reduces by approx. 30%

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message