微软Word文本分析器的" C" [英] Microsoft word Text Parser in "C"
问题描述
我想知道程序采用解析,并获得从Microsoft Word(.doc和.DOCX)文件的文本内容。应使用是普通的C编程语言(应该是GCC)。
I would like to know the procedure to adopt to parse and obtain text content from Microsoft word (.doc and .docx) documents . programming language used should be plain "C" (should be gcc).
有没有已经做好这项工作的图书馆,
Are there any libraries that already do this job,
扩展:我可以使用同样的方法来解析微软功率点的文件还文本
extension : can i use the same procedure to parse text from Microsoft power point files also ?
推荐答案
Microsoft Word文档是一个巨大的野兽 - 你的绝对的不想写这篇code自己。考虑使用现有的免费词库如 antiword 或的 wvWare 。
Microsoft Word documents are an enormous beast - you definitely don't want to be writing this code yourself. Look into using an existing free Word library such as antiword or wvWare.
这篇关于微软Word文本分析器的" C"的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!