Pdf.js(用于节点)未呈现pdf的全部内容 [英] Pdf.js (for node) not rendering entire contents of pdf

查看：190 发布时间：2020/5/25 5:27:28 javascript node.js pdf pdf.js

本文介绍了Pdf.js(用于节点)未呈现pdf的全部内容的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试使用 https:/来搜索pdf文本/www.npmjs.com/package/pdfjs-dist-for-node .

我的代码如下:

 gettext: function(){
     var data = '../static/example.pdf';
         return pdfjs.getDocument(data).then(function(pdf) {
     var pages = [];
     for (var i = 0; i < pdf.numPages; i++) {
                 pages.push(i);
     }
     return Promise.all(pages.map(function(pageNumber) {
                 return pdf.getPage(pageNumber + 1).then(function(page) {
         return page.getTextContent().then(function(textContent) {
                         return textContent.items.map(function(item) {
             return item.str;
                         }).join(' ');
         });
                 });
     })).then(function(pages) {
         return pages.join("\r\n")
     });
         }).then(function(pages){
     console.log(pages)
     });


 }

这似乎可行，但是它会跳过部分文本.具体来说，它会跳过我在原始pdf文档中无法用鼠标突出显示的内容.有没有办法让pdf.js提取这些数据?

This seems to work, but it skips parts of the text. Specifically, it skips whatever I can't highlight with the mouse in the original pdf doc. Is there a way to get pdf.js to pick up on this data?

Pdf.js(用于节点)未呈现pdf的全部内容 [英] Pdf.js (for node) not rendering entire contents of pdf

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录关闭

Pdf.js(用于节点)未呈现pdf的全部内容 [英] Pdf.js (for node) not rendering entire contents of pdf

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录 关闭

登录关闭