Node.js 比使用 Tesseract.Js 的浏览器 (Safari) 慢 20 倍 [英] Node.js 20x slower than browser (Safari) with Tesseract.Js

查看:43
本文介绍了Node.js 比使用 Tesseract.Js 的浏览器 (Safari) 慢 20 倍的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

JS 新手和 Node.js 新手.在 Safari 中运行 Tesseract.js(文本识别软件:http://tesseract.projectnaptha.com)大约需要 10秒并立即开始输出进度.
Node (v6.9.1)(从终端运行或通过 Electron 运行)在开始输出到控制台之前将 CPU 运行到 100% 持续 4 分 20 秒.然后它在大约同一时间完成.

New to JS and very new to Node. Running Tesseract.js (text recognition software: http://tesseract.projectnaptha.com) in Safari takes about 10 sec and begins outputting progress immediately.
Node (v6.9.1)(run from terminal or through Electron) runs CPU to 100% for 4min 20sec before it begins outputting to console. It then finishes in about the same time.

建议采取哪些故障排除步骤?这对 Node 来说很常见吗?
我在日志中看到的唯一区别是 Safari在缓存 eng.traineddata 中找到" 清除和禁用缓存对时间的影响很小.尝试了一些 .JPG 和 .PNG (300-600kb) 文件,结果相同 - 但 BMP (3.7MB) 给出了 17 秒的快速响应 - 然后出现错误并且没有完成.(这是下一个滴答"问题吗?)

What troubleshooting steps are recommended? Is this common for Node?
Only difference I see in logs is Safari "found in cache eng.traineddata" Clearing and disabling the cache only minimally affect the time. Have tried a few .JPG and .PNG (300-600kb) files with same result - but BMP (3.7MB) gave fast 17 sec response - then errors and didn't finish. (Is this a 'next tick' problem?)

var Tesseract = require('tesseract.js');  
var image = "./images/sample.jpg";

function tesseract(){
Tesseract.recognize(image)
.progress(function(message){console.log(message)})
.then(result => console.log(result.text))
} 
tesseract();

(编辑器强制将输出格式化为代码)
节点控制台.日志

(the editor is forcing the output to be formatted as code)
NODE console.Log

>Bash-3.2$ node JustTess.js  
   *Waits 4+ min and Then*   
{ status: 'loading tesseract core' }  
{ status: 'loaded tesseract core' }  
{ status: 'initializing tesseract', progress: 0 }  
pre-main prep time:108 ms  
{ status: 'initializing tesseract', progress: 1 }  
{ status: 'loading eng.traineddata', progress: 0 }  
{ status: 'loading eng.traineddata', progress: 1 }  
{ status: 'initializing api', progress: 0 }   
{ status: 'initializing api', progress: 0.3 }   
{ status: 'initializing api', progress: 0.6 }   
{ status: 'initializing api', progress: 1 }   
{ status: 'recognizing text', progress: 0 }   
{ status: 'recognizing text', progress: 0.014285714 }...                

SAFARI 控制台.log

SAFARI console.log

>[Log]  – {status: "loading tesseract core"}  
[Log]  – {status: "loaded tesseract core"}   
[Log]  – {status: "initializing tesseract api"}  
[Log] pre-main prep time: 115 ms (index.js, line 10)  
[Log]  – {status: "initialized tesseract api"}   
[Log]  – {status: "found in cache eng.traineddata"}   
[Log]  – {status: "loaded eng.traineddata"}   
[Log]  – {status: "initialized with language"}   
[Log]  – {status: "recognizing text", progress: 0}   
[Log]  – {status: "recognizing text", progress: 0.0142}...    

带有 BMP 的节点

bash-3.2$ node JustTess.js
*After 17 sec*
    { status: 'initializing tesseract', progress: 0 }
pre-main prep time: 118 ms
{ status: 'initializing tesseract', progress: 1 }
{ status: 'loading eng.traineddata', progress: 0 }
{ status: 'loading eng.traineddata', progress: 1 }
{ status: 'initializing api', progress: 0 }
{ status: 'initializing api', progress: 0.3 }
{ status: 'initializing api', progress: 0.6 }
Error in pixRemoveColormap: pixs must be {1,2,4,8} bpp
Error in pixGetDepth: pix not defined
Error in pixGetWpl: pix not defined
Error in pixCreateHeader: depth must be {1, 2, 4, 8, 16, 24, 32}
Error in pixCreateNoInit: pixd not made
Error in pixCreateTemplateNoInit: pixd not made
Error in pixCreateTemplate: pixd not made
Error in pixCopy: pixd not made
{ status: 'initializing api', progress: 1 }
3
3

/Users/brent/Library/Mobile Documents/com~apple~CloudDocs/Programming/GitHub/ba/node_modules/tesser
act.js-core/index.js:4
function f(a){throw a;}var h=void 0,i=!0,j=null,k=!1;function aa(){return function(){}}function ba(
a){return function(){return a}}var n,Module;Module||(Module=eval("(function() { try { return Tesser
actCore || {} } catch(e) { return {} } })()"));var ca={},da;for(da in Module)Module.hasOwnProperty(
da)&&(ca[da]=Module[da]);var ea=i,fa=!ea&&i;
              ^
abort(3) at Error
    at Error (native)
    at Na (/Users/brent/Library/Mobile Documents/com~apple~CloudDocs/Programming/GitHub/ba/node_mod
ules/tesseract.js-core/index.js:32:26)
    at ka (/Users/brent/Library/Mobile Documents/com~apple~CloudDocs/Programming/GitHub/ba/node_mod
ules/tesseract.js-core/index.js:507:108)
    at Array.JHa (/Users/brent/Library/Mobile Documents/com~apple~CloudDocs/Programming/GitHub/ba/n
ode_modules/tesseract.js-core/index.js:402:25808)
    at xd (/Users/brent/Library/Mobile Documents/com~apple~CloudDocs/Programming/GitHub/ba/node_mod
ules/tesseract.js-core/index.js:382:924)
    at R.TesseractCore.V.Begin (/Users/brent/Library/Mobile Documents/com~apple~CloudDocs/Programmi
ng/GitHub/ba/node_modules/tesseract.js-core/index.js:511:288)
    at DumpLiterallyEverything (/Users/brent/Library/Mobile Documents/com~apple~CloudDocs/Programmi
ng/GitHub/ba/node_modules/tesseract.js/src/common/dump.js:13:8)
    at /Users/brent/Library/Mobile Documents/com~apple~CloudDocs/Programming/GitHub/ba/node_modules
/tesseract.js/src/common/worker.js:121:22
    at /Users/brent/Library/Mobile Documents/com~apple~CloudDocs/Programming/GitHub/ba/node_modules
/tesseract.js/src/common/worker.js:92:9
    at /Users/brent/Library/Mobile Documents/com~apple~CloudDocs/Programming/GitHub/ba/node_modules
/tesseract.js/src/node/lang.js:14:25
If this abort() is unexpected, build with -s ASSERTIONS=1 which can give more information.

推荐答案

此问题已通过更新 Tesseract.Js 软件得到解决.

This issue has been solved by an update to the Tesseract.Js software.

这篇关于Node.js 比使用 Tesseract.Js 的浏览器 (Safari) 慢 20 倍的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆