Java 中的模糊字符串搜索库 [英] Fuzzy string search library in Java
问题描述
我正在寻找用于模糊字符串搜索的高性能 Java 库.
I'm looking for a high performance Java library for fuzzy string search.
有很多算法可以找到相似的字符串,Levenshtein distance、Daitch-Mokotoff Soundex、n-grams 等.
There are numerous algorithms to find similar strings, Levenshtein distance, Daitch-Mokotoff Soundex, n-grams etc.
存在哪些 Java 实现?他们的利弊?我知道 Lucene,任何其他解决方案或 Lucene 是最好的?
What Java implementations exists? Pros and cons for them? I'm aware of Lucene, any other solution or Lucene is best?
我找到了这些,有人用过吗?
I found these, does anyone have experience with them?
推荐答案
Commons Lang 已经实现了 Levenshtein 距离.
Commons Lang has an implementation of Levenshtein distance.
Commons Codec 实现了 soundex 和 变音.
Commons Codec has an implementation of soundex and metaphone.
这篇关于Java 中的模糊字符串搜索库的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!