将pdf,doc,ppt转换为html5 [英] Convert pdf, doc, ppt to html5

查看:740
本文介绍了将pdf,doc,ppt转换为html5的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我已经搜索了(没有运气)开源软件,可以将doc,ppt和pdf转换为HTML5。(正是 Scribd 吗)Scribd的转换类型是否有开源等价物?



<如果有人知道付费服务,那也行。 Scribd有一个 API ,但这是用于flash查看器的。此外,我希望托管我自己的内容,因为我需要进一步控制已转换的HTML文档

解决方案

你不太可能找到能够做到这一切的单一产品,特别是在开源世界。你最终可能会依赖混杂的东西,甚至可能需要链接一些转换器才能获得HTML。 (例如PDF - > ps - > HTML)



OpenOffice支持转换为HTML,可以从命令行调用。



http://pdftohtml.sourceforge.net/ 看起来相当擅长将pdf转换为html。



对于Word ML或OpenXML格式的Doc,可以想象您可以使用XSLT转换,因为输入和输出格式都是XML。我已经看到一些漂浮在网络上的样式表,但YMMV。



顺便说一下,为什么对开源有特定的要求? MS Powerpoint已经支持以HTML格式保存。


I've googled (without any luck) for open source software that can convert doc, ppt, and pdf to HTML5. (Exactly what Scribd does) Are there open source equivalents to the type of conversion Scribd does?

If anyone knows of a paid service, that would also work. Scribd has an API, but that's for use with the flash viewer. Also, I would like to host my own content as I need further control over converted html document.

解决方案

You're unlikely to find a single offering that does all this, especially in the open source world. It's more likely that you'll end up relying on a mishmash of things, and may even need to chain some converters in order to get to HTML. (Eg PDF -> ps -> HTML)

OpenOffice supports conversion to HTML, and can be called from the command line.

http://pdftohtml.sourceforge.net/ looks reasonably good at converting pdf to html.

For Doc that is Word ML or OpenXML format it's conceivable that you could use XSLT transforms since both input and output formats are XML. I've seen some stylesheets floating around the net that do this, but YMMV.

Incidentally, why is there a specific requirement for open source? MS Powerpoint already supports save-as-HTML for example.

这篇关于将pdf,doc,ppt转换为html5的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆