如何做模糊字符串搜索没有一个沉重的数据库？ [英] How to do fuzzy string search without a heavy database?

查看：123 发布时间：2017/3/17 20:43:17 python database full-text-search fuzzy-search

本文介绍了如何做模糊字符串搜索没有一个沉重的数据库？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

我有一个目录号码与产品名称的映射：

I have a mapping of catalog numbers to product names:

35  cozy comforter
35  warm blanket
67  pillow

和需要搜索会发现拼写错误， cmfrter。

我们有使用edit-distance（difflib）的代码，但它可能不会扩展到18000个名称。

We have code using edit-distance (difflib), but it probably won't scale to the 18000 names.

我实现了与Lucene类似的东西，但因为 PyLucene 只包裹

I achieved something similar with Lucene, but as PyLucene only wraps Java that would complicate deployment to end-users.

SQLite通常不会编写全文或评分。

SQLite doesn't usually have full-text or scoring compiled in.

Xapian绑定与C ++类似，并有一些学习曲线。

The Xapian bindings are like C++ and have some learning curve.

Whoosh 尚未详细记录，但包括可滥用的法术 -

Whoosh is not yet well-documented but includes an abusable spell-checker.

还有什么？