是否有匈牙利语的第三方SQL Server分词器这样的事情? [英] Is there such a thing as third party SQL Server word breaker for Hungarian language?
问题描述
我想在全文索引上使用 CONTAINS
,并在匈牙利数据上使用 FORMSOF(...)
。
是否有可能?我知道它在SQL Server中是默认不支持的。
I want to use CONTAINS
on a fulltext index and use FORMSOF(...)
on Hungarian data.
Is it possible? I KNOW it is not supported by default in SQL Server.
推荐答案
SQL Server可以加载自定义的断字符和词干分析器, href =http://msdn.microsoft.com/en-us/library/ms142509.aspx =nofollow>破碎文字和Stemmers 。如果您找不到匈牙利语的词干,那么总有可能创建一个自己的词: Word Breaker和Stemmer Sample ,另请参阅让LRSAMPLE自定义分词器在64位SQL Server 2008上工作一>。你不必自己实现字典,你可以简单地重用例如 libstemmer匈牙利雪球算法,并将其打包为SQL Server stemmer。
SQL Server can load custom word breakers and stemmers, see Word Breakers and Stemmers. If you cannot find a Hungarian stemmer there is always the possibility of creating one your own: Word Breaker and Stemmer Sample, see also Getting the LRSAMPLE custom word-breaker to work on 64-bit SQL Server 2008. You don't have to implement the dictionary yourself, you could simply reuse for instance the libstemmer Hungarian Snowball algorithm and package it as a SQL Server stemmer.
这篇关于是否有匈牙利语的第三方SQL Server分词器这样的事情?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!