lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Digy" <digyd...@gmail.com>
Subject RE: [jira] Created: (LUCENENET-44) Indexing of some pdf files doesnt give desired result in ver 1.9.0.5 but works fine in ver 1.3.3.1
Date Wed, 27 Jun 2007 17:32:01 GMT
Hi Shukla,

Lucene indexes just "text" files. Therefore conversion of a pdf document(or word,excel,image
etc.) to text is not related with Lucene. Before indexing, you should convert them to text.

IFilter provides just a standard approach for this kind of conversions.

Below link may be helpful for you 
http://www.codeproject.com/csharp/IFilter.asp

DIGY



-----Original Message-----
From: shukla dhaval v (JIRA) [mailto:jira@apache.org] 
Sent: Monday, June 25, 2007 3:49 PM
To: lucene-net-dev@incubator.apache.org
Subject: [jira] Created: (LUCENENET-44) Indexing of some pdf files doesnt give desired result
in ver 1.9.0.5 but works fine in ver 1.3.3.1

Indexing of some pdf files doesnt give desired result in ver 1.9.0.5 but works fine in ver
1.3.3.1
--------------------------------------------------------------------------------------------------

                 Key: LUCENENET-44
                 URL: https://issues.apache.org/jira/browse/LUCENENET-44
             Project: Lucene.Net
          Issue Type: Bug
         Environment: .NET, Windows XP,lucene.net ver1.9.0.5
            Reporter: shukla dhaval v


Dear Sir,
 
We are using lucene.net ver. 1.9.0.5 for content searching. The problem 
we are facing is with indexing of .pdf files. We have installed the 
ifilters for pdf files. There are certain pdf files which give result 
with the older version of lucene.net 1.3.3.1 but not with the current 
one.  Please advise how to solve this issue.
 
Thank you
Dhaval Shukla
Programmer
Sansun Software Pvt Ltd
 
Product Development Division of:
Easy Data Access
5988 Mid Rivers Mall Drive
St. Charles, MO 63304
www.edausa.com


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message