提高模糊字符串匹配字典的性能 [英] Improving performance of fuzzy string matching against a dictionary

查看：114 发布时间：2017/4/3 13:44:49 java data-structures

本文介绍了提高模糊字符串匹配字典的性能的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

所以我正在使用 SecondString 进行模糊字符串匹配，其中我有一个大字典进行比较（字典中的每个条目都有相关联的非唯一标识符）。我目前正在使用一个hashMap来存储这个字典。

So I'm currently working for with using SecondString for fuzzy string matching, where I have a large dictionary to compare to (with each entry in the dictionary has an associated non-unique identifier). I am currently using a hashMap to store this dictionary.

当我想做模糊字符串匹配时，我首先检查字符串是否在hashMap中，然后迭代所有其他潜在的键，计算字符串相似度并存储具有最高相似度的k，v对/ s。根据我使用的字典可能需要很长时间（12330 - 1800035条目）。有什么办法加快速度吗？我目前正在写一个记忆功能/表格，作为一种加快速度的方法，但任何人都可以想到一个更好的方式来提高这个速度吗？也许是一个不同的结构或其他我错过的东西。

When I want to do fuzzy string matching, I first check to see if the string is in the hashMap and then I iterate through all of the other potential keys, calculating the string similarity and storing the k,v pair/s with the highest similarity. Depending on which dictionary I am using this can take a long time ( 12330 - 1800035 entries ). Is there any way to speed this up or make it faster? I am currently writing a memoization function/table as a way of speeding this up, but can anyone else think of a better way to improve the speed of this? Maybe a different structure or something else I'm missing.

非常感谢提前，

Nathan

提高模糊字符串匹配字典的性能 [英] Improving performance of fuzzy string matching against a dictionary

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

提高模糊字符串匹配字典的性能 [英] Improving performance of fuzzy string matching against a dictionary

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭