找到最相似的字符串输入最快的方法？ [英] Fastest way to find most similar string to an input?

查看：330 发布时间：2015/11/30 14:26:10 algorithm string language-agnostic

本文介绍了找到最相似的字符串输入最快的方法？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

由于长度为N的查询串Q，和长度的m序列的列表L确切的N，什么是最有效的算法来查找字符串L中以最少的不匹配位置至Q？例如：

Given a query string Q of length N, and a list L of M sequences of length exactly N, what is the most efficient algorithm to find the string in L with the fewest mismatch positions to Q? For example:

Q = "ABCDEFG";
L = ["ABCCEFG", "AAAAAAA", "TTAGGGT", "ZYXWVUT"];
answer = L.query(Q);  # Returns "ABCCEFG"
answer2 = L.query("AAAATAA");  #Returns "AAAAAAA".

最显而易见的方法是扫描每一个序列中L，使搜索取O（M * N）。有没有办法做到这一点在次线性时间呢？我不在乎，如果有一个大量的前期成本，组织L放入一些数据结构，因为它会被询问了很多次。此外，擅自处理追平比分是好的。

The obvious way is to scan every sequence in L, making the search take O(M * N). Is there any way to do this in sublinear time? I don't care if there's a large upfront cost to organizing L into some data structure because it will be queried a lot of times. Also, handling tied scores arbitrarily is fine.

编辑：为了澄清，我正在寻找的汉明距离

To clarify, I am looking for the Hamming distance.

找到最相似的字符串输入最快的方法？ [英] Fastest way to find most similar string to an input?

问题描述

推荐答案

相关文章

C/C++最新文章

热门教程

热门工具

登录关闭

找到最相似的字符串输入最快的方法？ [英] Fastest way to find most similar string to an input?

问题描述

推荐答案

相关文章

C/C++最新文章

热门教程

热门工具

登录 关闭

登录关闭