IFilter的或SDK的多种文件类型? [英] IFilter or SDK for many file types?
问题描述
有谁知道一个API / SDK或IFilter的.NET中,可以读取以下文件的主题(标题元数据)和文字:
Does anybody know of an API/SDK or IFilter in .NET that can read the subject ('title' metadata) and text from the following files:
.PDF .DOC .XLS .PPT .CSV 。文本 .DOCX .XLS .PPTX + OpenOffice的开放文档标准。
.PDF .DOC .XLS .PPT .CSV .TXT .DOCX .XLS .PPTX + the OpenOffice and Open Document standards.
开源将是真棒......但商业是OK了。
Open source would be awesome... but commercial is OK too.
我找不到任何东西任何地方!
I can't find anything anywhere!
推荐答案
我不认为你可以找到一个单一的IFilter,将能够访问所有这些类型的内容。通常情况下,的IFilter将是一种特定的技术。
I don't think you will be able to find a single IFilter that will be able to access the contents of all of those types. Typically, an IFilter will be for a specific technology.
例如,Adobe公司有一对PDF文件中,微软提供了一个用于办公,可以做的Word,Excel,PowerPoint中,CSV(我相信来自pre安装Windows)。
For example, Adobe have one for PDFs, Microsoft provide one for Office that can do Word, Excel, Powerpoint, CSV (that I believe comes pre-installed with Windows).
这篇关于IFilter的或SDK的多种文件类型?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!