我如何估算Lucene索引的大小? [英] How do I estimate the size of a Lucene index?

查看:93
本文介绍了我如何估算Lucene索引的大小?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

是否存在可以用来估算新Lucene索引大小的已知数学公式?我知道我要索引多少个字段,以及每个字段的大小.而且,我知道将为多少项目建立索引.那么,一旦这些由Lucene处理,它将如何转换为字节?

Is there a known math formula that I can use to estimate the size of a new Lucene index? I know how many fields I want to have indexed, and the size of each field. And, I know how many items will be indexed. So, once these are processed by Lucene, how does it translate into bytes?

推荐答案

这是 lucene索引格式文档. 主文件是复合索引(.cfs文件). 如果您有字词统计信息,则可能可以估算出.cfs文件大小, 请注意,这根据您使用的分析器和定义的字段类型而有很大差异.

Here is the lucene index format documentation. The major file is the compound index (.cfs file). If you have term statistics, you can probably get an estimate for the .cfs file size, Note that this varies greatly based on the Analyzer you use, and on the field types you define.

这篇关于我如何估算Lucene索引的大小?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆