lucenenet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From huân bùi (JIRA) <j...@apache.org>
Subject [jira] [Updated] (LUCENENET-604) Search text without accent to get text has accent
Date Fri, 31 Aug 2018 11:06:00 GMT

     [ https://issues.apache.org/jira/browse/LUCENENET-604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

huân bùi updated LUCENENET-604:
-------------------------------
    Description: 
I made the index data that is the vietnamese characters:
{code:java}
{Id="zgRpBGVKTEqBWkAEDkApig",Name="[Bình chọn 2 câu hỏi] - Mesut Ozil & Jack Ruddy",Desc="[Bình
chọn 2 câu hỏi] - Mesut Ozil & Jack Ruddy"},
{Id="T8t5PKAng0aDwNPBTKR-Hw",Name="[Bình chọn 2 câu hỏi] - Ben Hamer",Desc="[Bình chọn
2 câu hỏi] - Ben Hamer"},
{Id="84o_eBV9cU-OIUeC8RpTaQ",Name="[Top 5 trận đấu gần đây nhất] - Fulham - Burnley",Desc="[Top
5 trận đấu gần đây nhất] - Fulham - Burnley"},
{Id="Ch34ACvS2km-gIhHtHWcaA",Name="[Lịch thi đấu] - Huddersfield Town - Cardiff City",Desc="[Lịch
thi đấu] - Huddersfield Town - Cardiff City"},
{Id="rtlNhfrg2k-jNABkkKsUAg",Name="[Lịch thi đấu] - Arsenal - West Ham United",Desc="[Lịch
thi đấu] - Arsenal - West Ham United"}{code}
I want to search to get data that keyword has no accent, ex: "Lich thi dau". Currently, I
can't search to get data when use this case.

Can you guide me, how to search with keyword has no accent to get data that indexed.

I found a way that customize analyzer, but I can't find a sample that can run in .net.

  was:
I made the index data that is the vietnamese characters:
{code:java}
{Id="zgRpBGVKTEqBWkAEDkApig",Name="[Bình chọn 2 câu hỏi] - Mesut Ozil & Jack Ruddy",Desc="[Bình
chọn 2 câu hỏi] - Mesut Ozil & Jack Ruddy"},
{Id="T8t5PKAng0aDwNPBTKR-Hw",Name="[Bình chọn 2 câu hỏi] - Ben Hamer",Desc="[Bình chọn
2 câu hỏi] - Ben Hamer"},
{Id="84o_eBV9cU-OIUeC8RpTaQ",Name="[Top 5 trận đấu gần đây nhất] - Fulham - Burnley",Desc="[Top
5 trận đấu gần đây nhất] - Fulham - Burnley"},
{Id="Ch34ACvS2km-gIhHtHWcaA",Name="[Lịch thi đấu] - Huddersfield Town - Cardiff City",Desc="[Lịch
thi đấu] - Huddersfield Town - Cardiff City"},
{Id="rtlNhfrg2k-jNABkkKsUAg",Name="[Lịch thi đấu] - Arsenal - West Ham United",Desc="[Lịch
thi đấu] - Arsenal - West Ham United"}{code}
I want to search to get data that keyword has no accent, ex: "Lich thi dau". Currently, I
can't search to get data when use this case.

Can you guide me, how to search with keyword has no accent to get data that indexed.

I found a way that customize analyzer, but I can't find a sample.


> Search text without accent to get text has accent
> -------------------------------------------------
>
>                 Key: LUCENENET-604
>                 URL: https://issues.apache.org/jira/browse/LUCENENET-604
>             Project: Lucene.Net
>          Issue Type: Wish
>          Components: Lucene.Net Demo
>    Affects Versions: Lucene.Net 4.8.0
>            Reporter: huân bùi
>            Priority: Minor
>             Fix For: Lucene.Net 4.8.0
>
>
> I made the index data that is the vietnamese characters:
> {code:java}
> {Id="zgRpBGVKTEqBWkAEDkApig",Name="[Bình chọn 2 câu hỏi] - Mesut Ozil & Jack
Ruddy",Desc="[Bình chọn 2 câu hỏi] - Mesut Ozil & Jack Ruddy"},
> {Id="T8t5PKAng0aDwNPBTKR-Hw",Name="[Bình chọn 2 câu hỏi] - Ben Hamer",Desc="[Bình
chọn 2 câu hỏi] - Ben Hamer"},
> {Id="84o_eBV9cU-OIUeC8RpTaQ",Name="[Top 5 trận đấu gần đây nhất] - Fulham
- Burnley",Desc="[Top 5 trận đấu gần đây nhất] - Fulham - Burnley"},
> {Id="Ch34ACvS2km-gIhHtHWcaA",Name="[Lịch thi đấu] - Huddersfield Town - Cardiff
City",Desc="[Lịch thi đấu] - Huddersfield Town - Cardiff City"},
> {Id="rtlNhfrg2k-jNABkkKsUAg",Name="[Lịch thi đấu] - Arsenal - West Ham United",Desc="[Lịch
thi đấu] - Arsenal - West Ham United"}{code}
> I want to search to get data that keyword has no accent, ex: "Lich thi dau". Currently,
I can't search to get data when use this case.
> Can you guide me, how to search with keyword has no accent to get data that indexed.
> I found a way that customize analyzer, but I can't find a sample that can run in .net.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message