如何在Lucene中存储多种不同类型的文档 [英] How to store multiple distinct types of documents in Lucene

查看：73 发布时间：2020/5/4 7:39:34 lucene lucene.net

本文介绍了如何在Lucene中存储多种不同类型的文档的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个现有的Lucene商店，其中存储着数百万个文档，每个文档代表一个实体的元数据.我有几个Id字段(Id1，Id2 .... Id5)，每个文档对此字段可以有零个或多个值.一次只能由这些ID之一查询该索引.我已经独立索引了这些字段，并且一切都很好.我最初选择使用Lucene，因为它是迄今为止查询大量小文件的最快方法，我对自己的决定感到满意.

I have an existing Lucene store with many millions of documents, each one representing metadata for an entity. I have a few Id fields (Id1, Id2 .. Id5) and each document can have zero or many values for this field. The index is only ever queried by one of these Ids at a time. I've indexed these fields independently and it's is all working great. I initially chose to use Lucene as it was by far the fastest way to query such a vast number of small documents and I am happy with my decision.

但是，现在我必须存储另一种类型的文档，该文档还代表实体的另一种元数据，并具有(Id1，Id2 .. Id5)的值，并且这些ID也将单独查询它们.现有元数据和这组新数据将彼此独立存储和查询.

However now I must store another type of document which also represent a different kind of metadata for entities and have values for (Id1, Id2 .. Id5), and which also will be queried by one of those Ids separately. The existing metadata and this new set of data will be stored and queried for independently from each other.

如何通过ID来查询Lucene，但仅查询一种类型的文档.我可以考虑一些选择，但是我想从经验中知道那些建议，以便使Lucene易于管理和快速进行.

How do I query Lucene by an Id but for only one type of document. I can think of a few options, but I'd like to know what those in the know recommend from experience in order to keep Lucene manageable and fast.

使用单独的Lucene索引.由于文档类型是正交的，因此可以避免该问题.能够分别从索引进行读取和写入还有一个好处.
将新文档的Id1..Idn字段重命名为XId1 ... XIdn.这样，一种类型的文档将不会具有与另一种类型的文档相同的字段名称.似乎比实际的解决方案更像是一种避免该问题的解决方法.
添加一个数字字段类型"，并将索引更改为(类型，Idx).这种方法似乎很浪费，因为每个索引还必须包含类型.

我可以打破与现有设置的向后兼容性.如果我要添加其他文档类型，则可以重用该解决方案将是很好的选择.

I am able to break backwards compatibility with my existing setup. It would be great if the solution can be reused if I come to add another document type.

如何在Lucene中存储多种不同类型的文档 [英] How to store multiple distinct types of documents in Lucene

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

如何在Lucene中存储多种不同类型的文档 [英] How to store multiple distinct types of documents in Lucene

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭