如何更改SQL Server全文索引中的分词符 [英] How to change word-break characters in SQL Server Full-Text indexing

查看:173
本文介绍了如何更改SQL Server全文索引中的分词符的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

默认情况下,当告诉SQL Server(当前使用2008)到全文索引列时,它将处理诸如@和。之类的字符。作为工作破坏者,类似于。

我想限制破坏性字符只是,所以joe.bloggs@somewhere .com被视为一个单词。

看起来,人们可以选择一个语言的断字符对索引列 - 也许我需要设置一个自定义语言?



有谁知道我该怎么做? 为了让您的分词器能够与SQL Server一起飞行,您必须禁用签名验证并将您的COM CLSID添加到注册表中。欲了解更多信息看看这篇文章: http://blogs.msdn.com/shajan/default。 aspx
它帮助了我很多!然而,我从来没有设法创建自己的语言,所以我只是劫持了一个已经存在的语言。

By default, when one tells SQL Server (currently using 2008) to Full-Text index a column, it treats characters such as "@" and "." as work-breakers, similarly to " ".

I'd like to restrict the work-breaking characters to just be " ", so that "joe.bloggs@somewhere.com" is treated as a single word.

It appears that one can choose a "Language for Word Breaker" against the indexed column - perhaps I need to set up a custom language?

Does anyone know how I can do this?

解决方案

In order to make your word breaker fly with SQL Server you have to disable signature verification and add your COM CLSID to the registry. For more info check out this post: http://blogs.msdn.com/shajan/default.aspx It helped me a lot! However I never managed to create my own language so I simply hijacked an already existing one.

这篇关于如何更改SQL Server全文索引中的分词符的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆