印地语语音过滤器工厂 [英] Phonetic filter factory for Hindi

查看:83
本文介绍了印地语语音过滤器工厂的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在使用Apache solr,正在尝试使用语音过滤器工厂,我已经尝试了solr.PhoneticFilterFactory可用的所有编码器,但是它们都不支持印度语言.是否有其他可用的过滤器/方法,以便我可以获取印度语(例如印地语,泰米尔语,孟加拉语等)的语音表示

I am working with Apache solr ,I am trying to use phonetic filter factory , I have tried all the encoders that are available with solr.PhoneticFilterFactory but none of them is supporting indian languages . Is there any other Filter/Method available so that i can get phonetic representation for indian languages e.g Hindi,tamil,Bengali etc

否则,我们如何修改现有过滤器以支持这些语言.

If not then how we can modify existing filters to support these languages.

推荐答案

您是否尝试过新的Beider Morse过滤器工厂,该工厂刚刚在版本3.6中添加,并且(alas)尚未有完整的文档记录?

Have you tried the new Beider Morse Filter Factory, which was just added in version 3.6 and is (alas) not yet well-documented?

https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.BeiderMorseFilterFactory

它是为中欧和东欧姓氏的语音搜索而开发的,但也许它也适用于其他语言.我个人发现它比Soundex或其他较旧的soundalike方法要好得多.

It was developed for phonetic searching of Central and Eastern European surnames, but maybe it would work for other languages too. I have personally found that it works much better than Soundex or the other older soundalike methods.

这篇关于印地语语音过滤器工厂的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆