FORMSOF SQL Server中的词库 [英] FORMSOF Thesaurus in SQL Server

查看:256
本文介绍了FORMSOF SQL Server中的词库的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

有没有人在任何给定单词的替代品数量很高的情况下,在速度方面采取了任何表现措施。例如,我想用它来存储常见的拼写错误;希望有4-10个字的变体。

 < expansion> 
< sub>管理< / sub>
< sub> administraton< / sub>
< sub> aministraton< / sub>
< / expansion>

当您运行全文搜索时,性能如何随着数量的变化而降低?例如,我认为它必须做一个单独的全文搜索执行一个或?



另外,在叙词表xml文件中说20 / 30K条目 - 这是否会影响性能?

解决方案

不,但是性能测试通常是相当系统特定的。我建议将一些样本数据放在一起,运行自己的测试案例是最好的选择。

Has anyone done any performance measures with this in terms of speed where there is a high number of substitutes for any given word. For instance, I want to use this to store common misspellings; expecting to have 4-10 variations of a word.

<expansion>
   <sub>administration</sub>
   <sub>administraton</sub>
   <sub>aministraton</sub>
</expansion>

When you run a fulltext search, how does performance degrade with that number of variations? for instance, I assume it has to do a separate fulltext search performing an OR?

Also, having say 20/30K entries in the Thesaurus xml file - does this impact performance?

解决方案

No, but performance testing is very often quite system-specific. I'd suggest putting together some sample data and running your own test cases is your best bet.

这篇关于FORMSOF SQL Server中的词库的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆