Solr 在使用存储的 html 字段突出显示时剥离 html [英] Solr Strip html when highlighting with stored html fields

查看：20 发布时间：2021/12/30 8:48:16 solr ruby-on-rails-3.1 sunspot-solr

本文介绍了Solr 在使用存储的 html 字段突出显示时剥离 html的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在 Rails 中使用 Solr 和 Sunspot.

Using Solr and Sunspot in rails.

我正在使用如下字段类型搜索 html 字段:

I am searching on an html field using a field type like this:

<fieldType name="text_html" class="solr.TextField" omitNorms="false">
  <analyzer>
    <tokenizer class="solr.StandardTokenizerFactory"/>
    <charFilter class="solr.HTMLStripCharFilterFactory"/>
    <filter class="solr.StandardFilterFactory"/>
    <filter class="solr.LowerCaseFilterFactory"/>
    <filter class="solr.ISOLatin1AccentFilterFactory"/>
    <filter class="solr.PorterStemFilterFactory"/>
  </analyzer>
</fieldType>

然后我执行搜索并使用存储的字段，以便我可以在结果中返回突出显示的文本.我遇到的问题是存储的值中有原始的 html 文本.例如:搜索新闻"正在返回:

I am then performing a search and using a stored field so that I can return highlighted text in the results. The problem I am having is that the stored value has the original html text in it. For example: a search on 'news' is returning:

"与@@@hl@@@news@@@endhl@@@、体育、本地优惠和所有最新对话的社区联系.</div> </div> </div>"

"community connection to @@@hl@@@news@@@endhl@@@, sports, local deals and all the latest conversations.</div> </div> </div>"

然后我想用 html 包装的标签替换标签@@@hl@@@、@@@endhl@@@.

I then want to replace tags @@@hl@@@, @@@endhl@@@ with html wrapped tags.

我是否需要自己手动去除原始 html 标签(div 等)标签，或者有没有办法让存储的值已经去除 html 标签?

Do I need to manually strip out the original html tags (divs, etc) tags out myself or is there a way to get the stored value to already have html tags stripped out?

我知道如何手动执行此操作，只是想确保我没有遗漏 schema.xml 或 solrconfig.xml 中的某些内容.

I know how to do this manually, just wanted to make sure I wasn't missing something in the schema.xml or solrconfig.xml.

谢谢

Solr 在使用存储的 html 字段突出显示时剥离 html [英] Solr Strip html when highlighting with stored html fields

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Solr 在使用存储的 html 字段突出显示时剥离 html [英] Solr Strip html when highlighting with stored html fields

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭