Lucene/Solr如何在多字段/多面搜索中实现高性能? [英] How does Lucene/Solr achieve high performance in multi-field / faceted search?

查看：89 发布时间：2020/5/4 7:33:54 lucene internals faceted-search

本文介绍了Lucene/Solr如何在多字段/多面搜索中实现高性能?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

上下文

这主要是关于Lucene(或可能是Solr)内部结构的问题.主要主题是多面搜索，其中搜索可以沿多个独立的对象维度(例如大小，速度，汽车价格)进行.

This is a question mainly about Lucene (or possibly Solr) internals. The main topic is faceted search, in which search can happen along multiple independent dimensions (facets) of objects (for example size, speed, price of a car).

当使用关系数据库实现时，对于大量构面而言，多字段索引没有用处，因为可以按任意顺序搜索构面，因此使用特定顺序的多重索引的机会很小，并创建了所有可能的顺序的索引是无法忍受的.

When implemented with relational database, for a large number of facets multi-field indices are not useful, since facets can be searched in any order, so a specific ordered multi-index is used with low chance, and creating all possible orderings of indices is unbearable.

Solr的广告旨在很好地应对分面搜索任务，如果我认为正确的话，必须将它与Lucene关联(据说)才能在多字段查询(其中文档的字段与对象的方面相关)上表现良好.

Solr is advertised to cope well with the faceted search task, which if I think correctly has to be connected with Lucene (supposedly) performing well on multi-field queries (where fields of a document relate to facets of an object).

问题

Lucene的倒排索引可以存储在关系数据库中，自然也可以通过使用单字段索引的RDBMS轻松获取匹配文档的交集.

The inverted index of Lucene can be stored in a relational database, and naturally taking the intersections of the matching documents can also be trivially achieved with RDBMS using single-field indices.

因此，Lucene可能具有一些用于多字段查询的高级技术，而不仅仅是基于倒排索引获取匹配文档的交集.

Therefore, Lucene supposedly has some advanced technique for multi-field queries other than just taking the intersection of matching documents based on the inverted index.

问题是，这是什么技巧?更广泛地讲:为什么Lucene/Solr在理论上可以比RDBMS获得更好的多面搜索性能(如果可以)?

So the question is, what is this technique/trick? More broadly: Why can Lucene/Solr achieve better faceted search performance theoretically than RDBMS could (if so)?

注意:我的第一个猜测是Lucene将使用某种空间划分方法来划分从文档字段构建的向量空间作为维，但是据我所知Lucene并非纯粹基于向量空间.

Lucene/Solr如何在多字段/多面搜索中实现高性能? [英] How does Lucene/Solr achieve high performance in multi-field / faceted search?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Lucene/Solr如何在多字段/多面搜索中实现高性能? [英] How does Lucene/Solr achieve high performance in multi-field / faceted search?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭