lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From NightOwl888 <...@git.apache.org>
Subject [GitHub] lucenenet issue #182: Analysis Missing Tests and Bug Fixes
Date Wed, 24 Aug 2016 19:46:20 GMT
Github user NightOwl888 commented on the issue:

    https://github.com/apache/lucenenet/pull/182
  
    Unfortunately, the ignore comments are only in the .NET version.
    
    FYI - I got most of the Thai support working. However, there seems to be another issue
that is directly caused by the BreakIterator that is causing one test to fail - it doesn't
break apart combinations of Thai characters and English into separate words as the tests expect.

    
    I tried to hack it by adding a PadThai method to the input (not the tasty dish, it adds
spaces between Thai letters and non-Thai letters). Unfortunately, that messes up the math
on the offsets. I might be able to fix that, but this really doesn't seem like the right way
to do it (not to mention, there is ambiguity what to do when Thai and Roman numbers are mixed).
Any chance we can get a patched version of ICU? Or do you have any other ideas how to address
this?
    
    Anyway, with the latest commit, this reduces the number of failing tests to 28.
    
    



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

Mime
View raw message