使用python-tesseract获取已识别单词的边界框 [英] Getting the bounding box of the recognized words using python-tesseract

查看：1109 发布时间：2018/7/30 15:51:52 python image-processing ocr tesseract python-tesseract

本文介绍了使用python-tesseract获取已识别单词的边界框的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用python-tesseract从图像中提取单词。这是一个用于tesseract的python包装器，它是一个OCR代码。

I am using python-tesseract to extract words from an image. This is a python wrapper for tesseract which is an OCR code.

我使用以下代码来获取单词：

I am using the following code for getting the words:

import tesseract

api = tesseract.TessBaseAPI()
api.Init(".","eng",tesseract.OEM_DEFAULT)
api.SetVariable("tessedit_char_whitelist", "0123456789abcdefghijklmnopqrstuvwxyz")
api.SetPageSegMode(tesseract.PSM_AUTO)

mImgFile = "test.jpg"
mBuffer=open(mImgFile,"rb").read()
result = tesseract.ProcessPagesBuffer(mBuffer,len(mBuffer),api)
print "result(ProcessPagesBuffer)=",result

这只返回图像中的单词而不是它们的位置/大小/方向（换句话说，包含它们的边界框）。我想知道是否有任何方法可以获得它

This returns only the words and not their location/size/orientation (or in other words a bounding box containing them) in the image. I was wondering if there is any way to get that as well

使用python-tesseract获取已识别单词的边界框 [英] Getting the bounding box of the recognized words using python-tesseract

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

使用python-tesseract获取已识别单词的边界框 [英] Getting the bounding box of the recognized words using python-tesseract

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭