Plone全文索引Excel文件 [英] Plone full text indexing Excel files

查看:237
本文介绍了Plone全文索引Excel文件的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

如何自定义Plone搜索引擎以便对Excel文件进​​行全文索引?我已经安装了pdftotext和wv for pdf,word文件全文索引。

解决方案

如果添加 Products.OpenXml 到您的实例蛋,并将其安装在Plone中,您可以至少索引现代Office格式,至少为.docx和.xlsx。对于普通旧的Excel(.xls)文件,这不起作用。



我在几周前在Plone 4.3.2 buildout配置中尝试过:

  [instance] 
eggs =
...
Products.OpenXml

[版本]
#你需要一个比默认Plone更近的lxml,一些3.x版本
lxml = 3.3.3
Products.OpenXml = 1.1.1

或者或另外使用产品.AROfficeTransforms 的。我只尝试过与Products.OpenXml结合使用,但如果您只对旧版本的Excel表单感兴趣,则自己就可以使用Products.AROfficeTransforms。在buildout配置中:

  [instance] 
eggs =
...
产品.AROfficeTransforms

[版本]
Products.AROfficeTransforms = 0.11.0

它需要在系统上安装xlhtml二进制文件。这是一个古老的二进制文件,2002年最后更改。我没有尝试自己安装。


how can I customize Plone search engine in order to actvate full text indexing of excel files? I have already installed pdftotext and wv for pdf, word files full text indexing.

解决方案

If you add Products.OpenXml to your instance eggs and install it in Plone you can index modern Office formats, at least .docx and .xlsx. For plain old Excel (.xls) files this does not work.

I tried it in a Plone 4.3.2 buildout config a few weeks ago:

[instance]
eggs =
    ...
    Products.OpenXml

[versions]
# You need a more recent lxml than default Plone, some 3.x version
lxml = 3.3.3
Products.OpenXml = 1.1.1

Alternatively or additionally, use Products.AROfficeTransforms. I have only tried it in combination with Products.OpenXml, but Products.AROfficeTransforms on its own is sufficient if you are only interested in old-style excel sheets, .xls. In a buildout config:

[instance]
eggs =
    ...
    Products.AROfficeTransforms

[versions]
Products.AROfficeTransforms = 0.11.0

It requires the xlhtml binary to be installed on your system. This is an ancient binary, last changed in 2002. I did not try to install it myself.

这篇关于Plone全文索引Excel文件的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆