lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Granroth, Neal V." <>
Subject RE: Question
Date Thu, 07 Jan 2010 19:36:42 GMT
I am very surprised by this comment.
There is so much similarity between Java and C# that I found absolutely no difficulty with
the discussion and examples in "Lucene in Action" and in directly applying the techniques
to my C#/.NET projects.

Maybe it would be helpful for some of those who find the java examples confusing to explain
specifically why they are confusing.  Then we might consider putting together some type of
short "Guide to understanding Lucene for C# developers" or FAQ on the web site.

- Neal

-----Original Message-----
From: Ed Jones [] 
Sent: Thursday, January 07, 2010 3:57 AM
Subject: RE: Question

All I can say is that we found the lack of examples for .net problematic as when you are not
too up to speed with Java there are a lot of basic hurdlers to overcome.

-----Original Message-----
From: Olivier Spinelli [] 
Sent: 07 January 2010 09:55
Subject: RE: Question

Lucene.Net sticks to the APIs and classes used in the original Java
implementation of Lucene. The API names as well as class names are preserved
with the intention of giving Lucene.Net the look and feel of the C# language
and the .NET Framework. For example, the method Hits.length() in the Java
implementation now reads Hits.Length() in the C# port. 

In addition to the APIs and classes port to C#, the algorithm of Java Lucene
is ported to C# Lucene. This means an index created with Java Lucene is
back-and-forth compatible with the C# Lucene; both at reading, writing and
updating. In fact a Lucene index can be concurrently searched and updated
using Java Lucene and C# Lucene processes. 

It's merely all about switching from camelCase to PascalCase...



-----Message d'origine-----
De : Ed Jones [] 
Envoyé : jeudi 7 janvier 2010 10:27
À :
Objet : RE: Question

My problem with Lucene in Action and all the examples on the internet is
that they were all in Java and you have to understand exactly what Java
is doing to understand it all properly. It's for this very reason we had
to shun using in major projects. I wanted dearly to use it
but the learning curve was far too steep and there appears to be very
very few .net examples of code or help.

Instead we have invested a significant amount of money in buying in a
much more commercial search engine.

I am keeping an eye on the project though in-case it can be
used in other parts of our business, but again the same will apply, we
will need more non Java examples.


-----Original Message-----
From: Roger Chapman [] 
Sent: 07 January 2010 09:21
Subject: RE: Question

>From what I can remember the book Lucene in Action has a good section on
indexing documents and PDFs


-----Original Message-----
From: Ben Martz []
Sent: 06 January 2010 19:51
Cc: <>
Subject: Re: Question


I would definitely take Michael's advice to learn more about the

overall issue before you get too far.

A quick answer that may help is Windows does not ship with an iFilter

for PDF built-in. Installing Adobe Reader 8 or higher will install a

decent PDF iFilter.

I am a little surprised by your question though - I assume that you

have access to your own source code and could examine the result from

the iFilter that's being fed to the IndexWriter and compare the

behavior in the TXT case with the behavior in the PDF case?



Sent from my iPhone

On Jan 6, 2010, at 10:13, Michael Garski <>


> Todd,


> You'll need some way to extract the text from the PDF prior to

> indexing.  I'm not familiar with any packages that can do that but I

> have heard of them.  You may want to try searching the mailing list

> to see if there has been mention of one previously.  Lucid

> Imagination hosts a great mailing list search tool at


> Michael


> -----Original Message-----

> From: Todd McIndoo []

> Sent: Wednesday, January 06, 2010 10:11 AM

> To:

> Subject: Question


> Sorry if this is duplicate




> We are using of version I am trying to search a

> document

> which contains lots of PDFs. I want to search a document, which

> contains a

> specific word, using We are yielding results in text

> documents

> but not in PDF. Is there something we have to do to be able to

> search in PDF


> Documents. All ifilters have been installed on the computer so I do

> not

> think that is the issue.




> Regards,






> Todd McIndoo


View raw message