回到长期计数使用术语在一个文档中搜索弹性突间 [英] Returning term count for a single document using the terms facet in elastic search

查看:197
本文介绍了回到长期计数使用术语在一个文档中搜索弹性突间的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

说我有以下的搜索查询...

Say I have the following search query...

POST /topics/_search
{
    "fields": [
       "topic_attachment",
       "topic_replies",
       "topic_status"
    ],
    "query" : {
        "filtered" : {
            "query" : {
                "term" : { 
                    "_id" : "5478"
                }
            }
        }           
    },
    "facets": {
       "text": {
          "terms": {
             "field": "text",
             "size": 10,
             "order": "count"
            }
       }
    }
}

此搜索的结果是以下

{
   "took": 93,
   "timed_out": false,
   "_shards": {
      "total": 5,
      "successful": 5,
      "failed": 0
   },
   "hits": {
      "total": 1,
      "max_score": 1,
      "hits": [
         {
            "_index": "topics",
            "_type": "full-topic",
            "_id": "5478",
            "_score": 1,
            "fields": {
               "topic_replies": 1141,
               "topic_status": 0,
               "topic_attachment": false
            }
         }
      ]
   },
   "facets": {
      "text": {
         "_type": "terms",
         "missing": 0,
         "total": 8058,
         "other": 8048,
         "terms": [
            {
               "term": "ω",
               "count": 1
            },
            {
               "term": "œyouâ",
               "count": 1
            },
            {
               "term": "œyou",
               "count": 1
            },
            {
               "term": "œwhisperedâ",
               "count": 1
            },
            {
               "term": "œwalt",
               "count": 1
            },
            {
               "term": "œunderstandingâ",
               "count": 1
            },
            {
               "term": "œtieâ",
               "count": 1
            },
            {
               "term": "œthe",
               "count": 1
            },
            {
               "term": "œpersonally",
               "count": 1
            },
            {
               "term": "œnappiesâ",
               "count": 1
            }
         ]
      }
   }
}

每个学期都有正好1.计数这是为什么?我知道本文档中的文本在普通不止一个学期。这是因为这个词只计数每增加一次文件?如果让我怎么算一个术语不止一次从一个单一的文件?

Each term has a count of exactly 1. Why is this? I know the text from this document has more than one term in common. Is this because the term count only increments once per document? If so how do I count a term more than once from a single document?

推荐答案

这是该文件的计数,而不是长期的频率。幸运的是新的聚合模块(用于更换在1.0.Beta2引进方面)计数已更名为 doc_count 澄清的是什么。

That's the document count, not the term frequency. Luckily with the new aggregations module (replacement for facets introduced in 1.0.Beta2) count has been renamed to doc_count to clarify what it is.

这篇关于回到长期计数使用术语在一个文档中搜索弹性突间的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆