获取与Solr / Lucene中匹配内容关联的元数据 [英] Obtain metadata associated with matched content in Solr/Lucene

查看：127 发布时间：2018/8/2 15:52:54 solr lucene indexing metadata

本文介绍了获取与Solr / Lucene中匹配内容关联的元数据的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一大堆文本文档，我将使用Solr进行索引，其格式是每行文本都有关联的元数据。例如：

I've a large set of text documents which I will index with Solr, in a format where each line of text has associated metadata. For example:

#metadata1
A line of text.
#metadata2
Another long, broken line of
#metadata3
text that should be searchable.

我想对此进行索引以便内容可搜索，包括跨越多行的词组匹配，但不是元数据。但是，我不能丢弃元数据：我希望任何匹配仍然具有相关的元数据。

I'd like to index this such that the content is searchable, including phrase matches spanning multiple lines, but not the metadata. However, I can't discard the metadata: I would like to have any matches still have the associated metadata.

例如。对文本行的查询将返回2个匹配，一个是第一行（及其关联的元数据metadata1），另一个是第二行和第三行（分别具有关联的metadata1和metadata2）。

E.g. A query for "line of text" would return 2 matches, one being the first line (and its associated metadata "metadata1") and the other being the second and third lines (with the associated "metadata1" and "metadata2" respectively).

有谁可以描述如何做到这一点，或者参考一个可以让我开始的教程？

Can anyone describe how this might be done, or reference a tutorial that would get me started?

获取与Solr / Lucene中匹配内容关联的元数据 [英] Obtain metadata associated with matched content in Solr/Lucene

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

获取与Solr / Lucene中匹配内容关联的元数据 [英] Obtain metadata associated with matched content in Solr/Lucene

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭