FORMSOF SQL Server中的词库 [英] FORMSOF Thesaurus in SQL Server
问题描述
有没有人在任何给定单词的替代品数量很高的情况下,在速度方面采取了任何表现措施。例如,我想用它来存储常见的拼写错误;希望有4-10个字的变体。
< expansion>
< sub>管理< / sub>
< sub> administraton< / sub>
< sub> aministraton< / sub>
< / expansion>
当您运行全文搜索时,性能如何随着数量的变化而降低?例如,我认为它必须做一个单独的全文搜索执行一个或?
另外,在叙词表xml文件中说20 / 30K条目 - 这是否会影响性能?
不,但是性能测试通常是相当系统特定的。我建议将一些样本数据放在一起,运行自己的测试案例是最好的选择。
Has anyone done any performance measures with this in terms of speed where there is a high number of substitutes for any given word. For instance, I want to use this to store common misspellings; expecting to have 4-10 variations of a word.
<expansion>
<sub>administration</sub>
<sub>administraton</sub>
<sub>aministraton</sub>
</expansion>
When you run a fulltext search, how does performance degrade with that number of variations? for instance, I assume it has to do a separate fulltext search performing an OR?
Also, having say 20/30K entries in the Thesaurus xml file - does this impact performance?
No, but performance testing is very often quite system-specific. I'd suggest putting together some sample data and running your own test cases is your best bet.
这篇关于FORMSOF SQL Server中的词库的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!