当查询包含多个术语时,如何在整个短语上使用 Solr 自动完成? [英] How to have Solr autocomplete on whole phrase when query contains multiple terms?
问题描述
我在这里查看了大量示例和其他问题,从他们那里,我的配置非常接近我需要的配置,但是我错过了最后一点锻炼的时间.我正在搜索以下值:
I've looked through a ton of examples and other questions here and from them, I've got my config very close to what I need but I'm missing one last little bit that I'm having a heck of a time working out. I'm searching on values like:
solar powered
solar glass
solar globe
solar lights
solar magic
solid brass
solid copper
我想要的:
- 如果我搜索
sol
,结果应该包括所有这些值.这行得通. - 如果我搜索
solar
,我应该只得到前五个.这行得通. - 如果我搜索
solar gl
,我应该只得到solar glass
和solar global
.这不起作用.相反,我得到一组solar
的匹配项和第二组gl
的匹配项.
- If I search for
sol
the result should include all these values. This works. - If I search for
solar
I should get just the first five. This works. - If I search for
solar gl
I should get onlysolar glass
andsolar globe
. This does not work. Instead, I get one set of matches forsolar
and a second set of matches forgl
.
简而言之,我想将输入字符串视为一个整体,而不考虑任何空格.我认为这是通过创建一个单独的查询(相对于索引)分析器来完成的,但我一直无法让它工作.任何人都可以建议一种配置,可以让我得到我正在寻找的东西?
In a nutshell, I want to consider the input string as a whole, regardless of any whitespace. I gather this is accomplished by creating a separate query (versus index) analyzer, but I've not been able to make it work. Can anyone suggest a configuration that will get me what I'm looking for?
我(未成功)尝试过:
- 使用
solar gl"进行查询
- 使用
mm=100%
查询 - 使用 KeywordTokenizerFactory 定义单独的查询和索引分析器.(我不知道我认为这会做什么.)
- 定义索引分析器而不是查询分析器.
- 定义一个没有分词器的查询分析器.
这是我当前的架构:
<field name="suggest_phrase" type="suggest_phrase"
indexed="true" stored="false" multiValued="false" />
和字段定义:
<fieldType name="suggest_phrase" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.KeywordTokenizerFactory" />
<filter class="solr.LowerCaseFilterFactory" />
</analyzer>
</fieldType>
和配置:
<searchComponent name="suggest_phrase" class="solr.SpellCheckComponent">
<lst name="spellchecker">
<str name="name">suggest_phrase</str>
<str name="classname">org.apache.solr.spelling.suggest.Suggester</str>
<str name="lookupImpl">org.apache.solr.spelling.suggest.fst.FSTLookup</str>
<str name="field">suggest_phrase</str>
<str name="buildOnCommit">true</str>
</lst>
</searchComponent>
<requestHandler class="org.apache.solr.handler.component.SearchHandler" name="/suggest_phrase">
<lst name="defaults">
<str name="spellcheck">true</str>
<str name="spellcheck.dictionary">suggest_phrase</str>
<str name="spellcheck.onlyMorePopular">true</str>
<str name="spellcheck.count">10</str>
<str name="spellcheck.collate">false</str>
</lst>
<arr name="components">
<str>suggest_phrase</str>
</arr>
</requestHandler>
推荐答案
终于找到答案了!我知道我真的很接近.结果证明我上面的配置是正确的,我只需要更改我的查询.
Found the answer, finally! I knew I was really close. Turns out my configuration above was correct and I simply needed to change my query.
- 使用
KeywordTokenizerFactory
以便将字符串作为一个整体编入索引. - 使用
SpellCheckComponent
作为请求处理程序. - 我遗漏的部分——不要用
q=
查询,而是用spellcheck.q=
.
- Use
KeywordTokenizerFactory
so that the strings get indexed as a whole. - Use
SpellCheckComponent
for the request handler. - The piece I was missing -- don't query with
q=<string>
but withspellcheck.q=<string>
.
给定上面提到的源字符串和 spellcheck.q=solar+gl
的查询,这会产生所需的结果:
Given the source strings noted above and a query of spellcheck.q=solar+gl
this yields the desired results:
solar glass
solar globe
这篇关于当查询包含多个术语时,如何在整个短语上使用 Solr 自动完成?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!