lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From juliya james <juliy...@yahoo.co.in.INVALID>
Subject Bigger index size produced with Lucene .Net 4.8.0 if compared with equivalent Java version
Date Thu, 05 Apr 2018 04:09:33 GMT
Hi,
The measurement of index sizes generated by Lucene .Net 4.8_beta005 shows that, it is bigger(~20-30%),
compared to the index created by Lucene Java 4.8.0, as can be seen in the table below.
Are there any known concerns related to the index size with Lucene .Net 4.8.0?

                                  
| Index Size in MB withLucene .Net 4.8_beta005 | Index Size in MB withLucene Java 4.8.0 |
| 5.4 | 5 |
| 27.46 | 25 |
| 41.32 | 32 |
| 47.66 | 45 |
| 95.3 | 90 |
| 238.14 | 221 |
| 476.4 | 385 |



Note: -  IndexWriter configuration and the quantity of the data which was indexed(input)
were the same for the measurements, so that the resultant index sizes can be compared. -
While indexing, data was split to several Lucene documents and each document may have ~1MB
of data. Most of the data was indexed with the field property   [Field.Store.NO, Field.Index.ANALYZED_NO_NORMS]
Thanks & Regards,Juliya

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message