如何使用Lucene和Java与tf-idf计算余弦相似度 [英] How to Calculate cosine similarity with tf-idf using Lucene and Java

查看：140 发布时间：2020/5/4 7:42:02 java lucene tf-idf cosine-similarity

本文介绍了如何使用Lucene和Java与tf-idf计算余弦相似度的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个查询和一组文档.我需要根据与tf-idf的余弦相似度对这些文档进行排名.有人可以告诉我我可以从Lucene那里得到什么支持来进行计算吗?我可以直接从Lucene计算哪些参数(我可以直接通过Lucene中的某种方法获取tf，idf吗?)，以及如何计算与Lucene的余弦相似度(如果我传递了查询的两个向量，是否有任何函数可以直接返回余弦相似度?文档?)

I have a query and a set of documents. I need to rank these documents based on the cosine similarity with tf-idf. Can someone please tell me what support I can get from Lucene to compute this ? What parameters I can directly calculate from Lucene (can I get tf, idf directly through some method in lucene?) and how to compute cosine similarity with Lucene (is there any function which directly returns cosine similarity if I pass two vectors of the query and the document ?)

预先感谢

如何使用Lucene和Java与tf-idf计算余弦相似度 [英] How to Calculate cosine similarity with tf-idf using Lucene and Java

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

如何使用Lucene和Java与tf-idf计算余弦相似度 [英] How to Calculate cosine similarity with tf-idf using Lucene and Java

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭