查看单个文档中单个字段的原始Solr令牌 [英] View raw Solr tokens for a single field on a single document

查看:88
本文介绍了查看单个文档中单个字段的原始Solr令牌的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在调试我的Solr模式,我想看看标记特定字段的结果.

I'm debugging my Solr schema and I'd like to see the results of tokenizing a specific field.

举个简单的例子,如果我有:

For a simplified example, if I have:

<fieldType name="text" class="solr.TextField" omitNorms="false">
    <analyzer>
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.StandardFilterFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.PorterStemFilterFactory"/>
        <filter class="solr.EdgeNGramFilterFactory" minGramSize="2" maxGramSize="15" side="front"/>
    </analyzer>
</fieldType>

并且我索引了一个值为"Hello, worlds!"的字段,我想看到以下内容:

and I indexed a field with the value "Hello, worlds!", I want to see something along the lines of:

hello world he el ll lo hel ell llo hell ello hello wo or rl ld wor orl rld worl orld

以确保一切都按照我的设想被标记化.

to ensure that everything is being tokenized as I envisage it is.

这有可能吗?

推荐答案

是的,管理员>分析正是您想要的.

Yes, Admin > Analysis is exactly what you want.

但是还有另一个很棒的工具,可让您读取索引并查看对字段或文档的索引编制情况.

But there's another great tool that allows you to read index and see how exactly a field or document was indexed.

它被称为 Luke ,在对模式进行故障排除和调整时,它是无价之宝.

It's called Luke and it's invaluable when troubleshooting and tweaking your schema.

这篇关于查看单个文档中单个字段的原始Solr令牌的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆