lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <e...@ehatchersolutions.com>
Subject Re: Need some help understanding what the "StandardAnalyzer" is doing here ...
Date Mon, 06 Nov 2006 14:57:46 GMT

On Nov 6, 2006, at 8:37 AM, Andy Berryman wrote:

> I have an index with a Field named "SKU" which is a "Text" type.   
> I'm using
> the "StandardAnalyzer" for indexing and searching.  I'm using "Luke" (
> http://www.getopt.org/luke/luke.jnlp) to do some testing for this  
> problem
> and to allow me to see how Lucene is parsing the query etc.  If I  
> provide
> the search expression as ... *SKU:andyb-test-item-001* ... Lucene  
> is parsing
> that to ... *SKU:"andyb test item-001"*.  Soo my question is ...  
> Why are the
> dashes between "andyb", "test", and "item" being removed but not  
> the one
> between "item" and "001"?

The StandardAnalyzer is designed to attempt to be clever with part  
numbers, id's and such, that intermix alphas and numerics.  Like R2D2  
and C-3P0

	Erik


Mime
View raw message