什么意思是“文档受欢迎度"?在Solr [英] What means "document popularity" in Solr

查看:106
本文介绍了什么意思是“文档受欢迎度"?在Solr的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

什么是 文档受欢迎程度 ?

What is document popularity in solr indexing..?

EDisMax解析器使用boost参数.在这样的示例&boost=popularity中,我注意到了一个查询.我不明白什么是 boost 以及 boost = popularity .在理解boost参数之前,我想知道什么是文档索引中的流行度".

EDisMax parser uses boost parameter. In the example &boost=popularity like that I noticed one query. I couldn't understand what is boost as well as boost=popularity. Before understanding the boost parameter I'd like to know what is "popularity" in document indexing.

推荐答案

popularity只是某些字段",已被用作示例,而boost是为edismax请求处理程序定义的查询参数.提升手段是根据某个字段值(或某些基于字段值的函数的结果)来影响得分(每个搜索命中的相关性).

popularity is just "some field" which has been used as an example while boost is a query parameter defined for the edismax request handler. Boosting means to influence the scoring (the relevance of each search hit) depending on some field value (or result of some function based on field values).

请参见The boost Parameter部分" rel ="nofollow"> https://cwiki.apache.org/confluence/display/solr/The+Extended+DisMax+Query+Parser .

See section The boost Parameter in https://cwiki.apache.org/confluence/display/solr/The+Extended+DisMax+Query+Parser.

如果要在自己的索引中实现类似popularity的内容,则必须:

If you want to implement something like popularity in your own index you would have to:

  1. 向您的架构中添加一个名为popularity的字段,其类型为intfloatExternalFileField(取决于您如何索引和应用它).
  2. 收集搜索结果的统计数据并存储与文档ID相关的统计数据(例如,通过评估访问日志)
  3. 在索引时间内或通过ExternalFileField(或将来通过docValues部分更新)存储从统计数据中获得的人气值.
  4. 通过设置参数boost=popularity(或在函数查询中使用popularity)在查询期间应用提升.
  1. add a field to your schema called popularity with type int or float or ExternalFileField (depends on how you index and apply it).
  2. gather statistics data for your search results and store those in relation to the document IDs (e.g. by evaluating access logs)
  3. during index time or via ExternalFileField (or in the future via docValues partial updates) store the popularity values that you get from your statistics data.
  4. apply the boost during query time by setting the parameter boost=popularity (or using popularity in a function query).

有关提高人气的更多信息:

More on popularity boosting:

https://www.safaribooksonline .com/blog/2014/11/04/implementing-popularity-boosting-in-search/

docValues部分更新: https://issues.apache.org/jira/browse/SOLR-5944

docValues partial update: https://issues.apache.org/jira/browse/SOLR-5944

ExternalFileField: http://www.findwise.com/blog/externalfilefield-in-solr/

ExternalFileField: http://www.findwise.com/blog/externalfilefield-in-solr/

这篇关于什么意思是“文档受欢迎度"?在Solr的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆