最好的文本搜索引擎与自定义Web应用程序集成? [英] Best text search engine for integrating with custom web app?

查看:115
本文介绍了最好的文本搜索引擎与自定义Web应用程序集成?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我们有一个网络应用程序,允许用户上传文件,创建自己的文件等等。上传的文件存储在Amazon S3上,创建的信息存储在MySQL数据库中。我在寻找的是某种搜索引擎,在那里我提供了我们所有的文本文档,每个文档都有一个唯一的ID,并且它构建了一个索引或其他内容。稍后,我可以为它提供搜索查询,并且它会将最匹配的文档(通过它们的ID)以及匹配文本的片段一起提取出来。

基本上,我们希望允许用户搜索他们上传的内容以及其他用户标记为公开的内容。该解决方案应该在标准的Linux服务器上运行,理想情况下它将是开源的,但如果价格不昂贵,我还会考虑付费解决方案。



迄今为止,我发现了三个潜在的候选人:


  1. MySQL全文搜索 - 我读过的一些报告显示它很慢

  2. < apache href =http://lucene.apache.org/java/docs/index.html =nofollow noreferrer> Apache Lucene - 不幸的是用Java写的,但如果我有至。据称速度很快

  3. 狮身人面像 - 似乎不像流行,理想情况下,无论我找到哪种解决方案都会得到很多社区支持。 如果你有以上任何经验的话。

    Solr 。它基于Lucene,速度非常快,从任何平台使用都非常简单。


    We have a web app that allows users to upload documents, create their own documents, and so on. Uploaded files are stored on Amazon S3, created information is stored in a MySQL database. What I'm looking for is some sort of search engine, where I feed it all of our text documents, each with a unique ID, and it builds an index or whatever. Later, I can give it search queries, and it will pull out the best matching documents (via their ID), along with snippets of matching text.

    Basically we want to allow our users to search through their repository of uploaded stuffs, along with anything that other users have marked as public. The solution should run on a standard Linux server, and ideally it would be open source, but I'll also consider paid solutions if they aren't outrageously priced.

    So far, I've found three potential candidates:

    1. MySQL Full Text Search - some reports I've read are that it's very slow
    2. Apache Lucene - unfortunately written in Java, but I'll use it if I have to. Supposedly fast
    3. Sphinx - doesn't seem to be as popular, ideally whatever solution I find will have lots of community support.

    Please let me know if there are any other good choices that I've overlooked, or if you have experience with any of the above.

    解决方案

    Take a look at Solr. It's based on Lucene, so it's very fast, and it's really easy to use from any platform.

    这篇关于最好的文本搜索引擎与自定义Web应用程序集成?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆