使用.net从图像文件中提取文本 [英] Extract Text from Image file using .net
问题描述
大家好,
我们正在使用Microsoft OCR组件的BPO项目中进行工作.该应用程序的主要功能是读取 图像文件(教科书页面的扫描图像)包含文本.在读取该图像文件之后,该应用程序必须创建一个excel表,在该表中应显示行号-行中的单词数-行中的字符数".
We are working on a BPO project in which we are using Microsoft OCR components. The main functionality of the application is to read a image file(scanned image of a page of a text book) contains text. After reading this image file, the application has to create an excel sheet in which it should show "Line Number--Number of words in LIne--Number of characters in line".
例如:
LineNumber ---单词数----字符数
LineNumber --- Number of Words ---- Number of Characters
1 b 30 nbsp; b&nspsp; nsp 100
1 30 100
2 nbsp; b 40 nbsp; bsp 200
2 40 200
以上数据应显示在excel文件中.
This above data should be shown in a excel file.
您能帮我们吗?
谢谢.
推荐答案
我希望创建一个CSV Visual Studio中的文件,可以由excel读取.与使用Excel Interop库相比,该应用程序的运行速度要快得多. CSV只是一个文本文件,其中excel列数据用逗号分隔.
I wold create a CSV file in Visual Studio which can be read by excel. The application would run much quicker than to use the Excel Interop Library. CSV is just a text file with the excel column data seperated by commas.
请参阅tnis网页,了解使用OCR密码子的C#项目
See tnis webpage for a C# project that uses the OCR Cpmponent
http://www.codeproject.com/Articles /41709/How-To-Use-Office-2007-OCR-Using-C
这篇关于使用.net从图像文件中提取文本的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!