在分析了Lucene文档字段标记后，我该如何读取它们? [英] How can I read a Lucene document field tokens after they are analyzed?

查看：63 发布时间：2020/5/4 7:31:31 lucene

本文介绍了在分析了Lucene文档字段标记后，我该如何读取它们?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

如果我创建一个文档并添加一个既存储又分析的字段，那么我如何才能将该字段作为令牌列表读回去?我有以下内容:

If I create a document and add a field that is both stored and analyzed, how can I then read this field back as a list of tokens? I have the following:

            Document doc = new Document();
            doc.add(new Field("url", fileName, Store.YES, Index.NOT_ANALYZED));
            doc.add(new Field("text", fileContent, Store.YES, Index.ANALYZED));
            // add the document to the index
            writer.addDocument(doc);

因此，fileContext是一个包含大量文本的字符串.对其进行分析，以便在将其存储在索引中时对其进行标记化.但是，如何获得这些令牌?存储完索引后，我可以从索引中检索该文档，并且可以从文档中读取文本"字段，但这是以字符串形式返回的.如果可能的话，我想获得代币.我的作家"是一个IndexWriter实例，它使用StandardAnalyzer.任何指针都将非常受欢迎.

So the fileContext is a String containing a lot of text. It is analyzed whereby it is tokenized when it is stored in the index. However, how can I get these tokens? I can retrieve the document from the index after it is stored, and I can read the "text" field from the document, but this is returned as a string. I would like to get the tokens if possible. My 'writer' is an IndexWriter instance and it uses a StandardAnalyzer. Any pointers would be very much welcomed.

非常感谢您

在分析了Lucene文档字段标记后，我该如何读取它们? [英] How can I read a Lucene document field tokens after they are analyzed?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

在分析了Lucene文档字段标记后，我该如何读取它们? [英] How can I read a Lucene document field tokens after they are analyzed?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭