Lucene/Solr 如何在多字段/分面搜索中实现高性能? [英] How does Lucene/Solr achieve high performance in multi-field / faceted search?

查看：19 发布时间：2022/1/15 13:15:05 lucene internals faceted-search

本文介绍了Lucene/Solr 如何在多字段/分面搜索中实现高性能?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

上下文

这是一个主要关于 Lucene(或可能是 Solr)内部的问题.主要主题是分面搜索，其中可以沿着对象的多个独立维度(方面)进行搜索(例如汽车的大小、速度、价格).

This is a question mainly about Lucene (or possibly Solr) internals. The main topic is faceted search, in which search can happen along multiple independent dimensions (facets) of objects (for example size, speed, price of a car).

当使用关系数据库实现时，对于大量构面，多字段索引没有用，因为可以按任何顺序搜索构面，因此使用特定有序多索引的机会很小，并创建所有可能的排序指数难以忍受.

When implemented with relational database, for a large number of facets multi-field indices are not useful, since facets can be searched in any order, so a specific ordered multi-index is used with low chance, and creating all possible orderings of indices is unbearable.

Solr 被宣传为可以很好地应对分面搜索任务，如果我认为正确的话，它必须与 Lucene 相关联(据说)在多字段查询(文档的字段与对象的方面相关)上表现良好.

Solr is advertised to cope well with the faceted search task, which if I think correctly has to be connected with Lucene (supposedly) performing well on multi-field queries (where fields of a document relate to facets of an object).

问题

Lucene的倒排索引可以存储在关系型数据库中，自然取匹配文档的交集也可以通过RDBMS使用单字段索引轻松实现.

The inverted index of Lucene can be stored in a relational database, and naturally taking the intersections of the matching documents can also be trivially achieved with RDBMS using single-field indices.

因此，Lucene 应该有一些用于多字段查询的先进技术，而不仅仅是基于倒排索引获取匹配文档的交集.

Therefore, Lucene supposedly has some advanced technique for multi-field queries other than just taking the intersection of matching documents based on the inverted index.

所以问题是，这种技术/技巧是什么?更广泛地说:为什么 Lucene/Solr 在理论上可以实现比 RDBMS 更好的分面搜索性能(如果可以的话)?

So the question is, what is this technique/trick? More broadly: Why can Lucene/Solr achieve better faceted search performance theoretically than RDBMS could (if so)?

注意:我的第一个猜测是 Lucene 会使用一些空间分区方法来分割从文档字段构建的向量空间作为维度，但据我了解，Lucene 并不是纯粹基于向量空间的.

Lucene/Solr 如何在多字段/分面搜索中实现高性能? [英] How does Lucene/Solr achieve high performance in multi-field / faceted search?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Lucene/Solr 如何在多字段/分面搜索中实现高性能? [英] How does Lucene/Solr achieve high performance in multi-field / faceted search?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭