From dev-return-10759-apmail-lucenenet-dev-archive=lucenenet.apache.org@lucenenet.apache.org Fri Aug 31 11:06:03 2018 Return-Path: X-Original-To: apmail-lucenenet-dev-archive@www.apache.org Delivered-To: apmail-lucenenet-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 114B622C40 for ; Fri, 31 Aug 2018 11:06:03 +0000 (UTC) Received: (qmail 21154 invoked by uid 500); 31 Aug 2018 11:06:02 -0000 Delivered-To: apmail-lucenenet-dev-archive@lucenenet.apache.org Received: (qmail 21110 invoked by uid 500); 31 Aug 2018 11:06:02 -0000 Mailing-List: contact dev-help@lucenenet.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucenenet.apache.org Delivered-To: mailing list dev@lucenenet.apache.org Received: (qmail 21099 invoked by uid 99); 31 Aug 2018 11:06:02 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 31 Aug 2018 11:06:02 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 53C2518068D for ; Fri, 31 Aug 2018 11:06:02 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -110.3 X-Spam-Level: X-Spam-Status: No, score=-110.3 tagged_above=-999 required=6.31 tests=[ENV_AND_HDR_SPF_MATCH=-0.5, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, USER_IN_DEF_SPF_WL=-7.5, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id eaWv2UFNvdAb for ; Fri, 31 Aug 2018 11:06:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 047285F382 for ; Fri, 31 Aug 2018 11:06:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 84EEAE0E1D for ; Fri, 31 Aug 2018 11:06:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 340CD23F98 for ; Fri, 31 Aug 2018 11:06:00 +0000 (UTC) Date: Fri, 31 Aug 2018 11:06:00 +0000 (UTC) From: =?utf-8?Q?hu=C3=A2n_b=C3=B9i_=28JIRA=29?= To: dev@lucenenet.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (LUCENENET-604) Search text without accent to get text has accent MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/LUCENENET-604?page=3Dcom.atlas= sian.jira.plugin.system.issuetabpanels:all-tabpanel ] hu=C3=A2n b=C3=B9i updated LUCENENET-604: ------------------------------- Description:=20 I made the index data that is the vietnamese characters: {code:java} {Id=3D"zgRpBGVKTEqBWkAEDkApig",Name=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 c=C3=A2u h= =E1=BB=8Fi] - Mesut Ozil & Jack Ruddy",Desc=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 c= =C3=A2u h=E1=BB=8Fi] - Mesut Ozil & Jack Ruddy"}, {Id=3D"T8t5PKAng0aDwNPBTKR-Hw",Name=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 c=C3=A2u h= =E1=BB=8Fi] - Ben Hamer",Desc=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 c=C3=A2u h=E1=BB= =8Fi] - Ben Hamer"}, {Id=3D"84o_eBV9cU-OIUeC8RpTaQ",Name=3D"[Top 5 tr=E1=BA=ADn =C4=91=E1=BA=A5u= g=E1=BA=A7n =C4=91=C3=A2y nh=E1=BA=A5t] - Fulham - Burnley",Desc=3D"[Top 5= tr=E1=BA=ADn =C4=91=E1=BA=A5u g=E1=BA=A7n =C4=91=C3=A2y nh=E1=BA=A5t] - Fu= lham - Burnley"}, {Id=3D"Ch34ACvS2km-gIhHtHWcaA",Name=3D"[L=E1=BB=8Bch thi =C4=91=E1=BA=A5u] = - Huddersfield Town - Cardiff City",Desc=3D"[L=E1=BB=8Bch thi =C4=91=E1=BA= =A5u] - Huddersfield Town - Cardiff City"}, {Id=3D"rtlNhfrg2k-jNABkkKsUAg",Name=3D"[L=E1=BB=8Bch thi =C4=91=E1=BA=A5u] = - Arsenal - West Ham United",Desc=3D"[L=E1=BB=8Bch thi =C4=91=E1=BA=A5u] - = Arsenal - West Ham United"}{code} I want to search to get data that keyword has no accent, ex: "Lich thi dau"= . Currently, I can't search to get data when use this case. Can you guide me, how to search with keyword has no accent to get data that= indexed. I found a way that customize analyzer, but I can't find a sample that can r= un in .net. was: I made the index data that is the vietnamese characters: {code:java} {Id=3D"zgRpBGVKTEqBWkAEDkApig",Name=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 c=C3=A2u h= =E1=BB=8Fi] - Mesut Ozil & Jack Ruddy",Desc=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 c= =C3=A2u h=E1=BB=8Fi] - Mesut Ozil & Jack Ruddy"}, {Id=3D"T8t5PKAng0aDwNPBTKR-Hw",Name=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 c=C3=A2u h= =E1=BB=8Fi] - Ben Hamer",Desc=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 c=C3=A2u h=E1=BB= =8Fi] - Ben Hamer"}, {Id=3D"84o_eBV9cU-OIUeC8RpTaQ",Name=3D"[Top 5 tr=E1=BA=ADn =C4=91=E1=BA=A5u= g=E1=BA=A7n =C4=91=C3=A2y nh=E1=BA=A5t] - Fulham - Burnley",Desc=3D"[Top 5= tr=E1=BA=ADn =C4=91=E1=BA=A5u g=E1=BA=A7n =C4=91=C3=A2y nh=E1=BA=A5t] - Fu= lham - Burnley"}, {Id=3D"Ch34ACvS2km-gIhHtHWcaA",Name=3D"[L=E1=BB=8Bch thi =C4=91=E1=BA=A5u] = - Huddersfield Town - Cardiff City",Desc=3D"[L=E1=BB=8Bch thi =C4=91=E1=BA= =A5u] - Huddersfield Town - Cardiff City"}, {Id=3D"rtlNhfrg2k-jNABkkKsUAg",Name=3D"[L=E1=BB=8Bch thi =C4=91=E1=BA=A5u] = - Arsenal - West Ham United",Desc=3D"[L=E1=BB=8Bch thi =C4=91=E1=BA=A5u] - = Arsenal - West Ham United"}{code} I want to search to get data that keyword has no accent, ex: "Lich thi dau"= . Currently, I can't search to get data when use this case. Can you guide me, how to search with keyword has no accent to get data that= indexed. I found a way that customize analyzer, but I can't find a sample. > Search text without accent to get text has accent > ------------------------------------------------- > > Key: LUCENENET-604 > URL: https://issues.apache.org/jira/browse/LUCENENET-604 > Project: Lucene.Net > Issue Type: Wish > Components: Lucene.Net Demo > Affects Versions: Lucene.Net 4.8.0 > Reporter: hu=C3=A2n b=C3=B9i > Priority: Minor > Fix For: Lucene.Net 4.8.0 > > > I made the index data that is the vietnamese characters: > {code:java} > {Id=3D"zgRpBGVKTEqBWkAEDkApig",Name=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 c=C3=A2u= h=E1=BB=8Fi] - Mesut Ozil & Jack Ruddy",Desc=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 = c=C3=A2u h=E1=BB=8Fi] - Mesut Ozil & Jack Ruddy"}, > {Id=3D"T8t5PKAng0aDwNPBTKR-Hw",Name=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 c=C3=A2u= h=E1=BB=8Fi] - Ben Hamer",Desc=3D"[B=C3=ACnh ch=E1=BB=8Dn 2 c=C3=A2u h=E1= =BB=8Fi] - Ben Hamer"}, > {Id=3D"84o_eBV9cU-OIUeC8RpTaQ",Name=3D"[Top 5 tr=E1=BA=ADn =C4=91=E1=BA= =A5u g=E1=BA=A7n =C4=91=C3=A2y nh=E1=BA=A5t] - Fulham - Burnley",Desc=3D"[T= op 5 tr=E1=BA=ADn =C4=91=E1=BA=A5u g=E1=BA=A7n =C4=91=C3=A2y nh=E1=BA=A5t] = - Fulham - Burnley"}, > {Id=3D"Ch34ACvS2km-gIhHtHWcaA",Name=3D"[L=E1=BB=8Bch thi =C4=91=E1=BA=A5u= ] - Huddersfield Town - Cardiff City",Desc=3D"[L=E1=BB=8Bch thi =C4=91=E1= =BA=A5u] - Huddersfield Town - Cardiff City"}, > {Id=3D"rtlNhfrg2k-jNABkkKsUAg",Name=3D"[L=E1=BB=8Bch thi =C4=91=E1=BA=A5u= ] - Arsenal - West Ham United",Desc=3D"[L=E1=BB=8Bch thi =C4=91=E1=BA=A5u] = - Arsenal - West Ham United"}{code} > I want to search to get data that keyword has no accent, ex: "Lich thi da= u". Currently, I can't search to get data when use this case. > Can you guide me, how to search with keyword has no accent to get data th= at indexed. > I found a way that customize analyzer, but I can't find a sample that can= run in .net. -- This message was sent by Atlassian JIRA (v7.6.3#76005)