PDF内容读取到SQL表 [英] PDF content read to SQL table
问题描述
我必须从PDF文档中读取某些数据并将它们存储在sql 2008表中。
我使用itextsharp来读取完整的PDF并显示在富文本框中。但是如果文本存储为图像则不会读取内容。
现在我需要做的是,
1.假设我的PDF包含数据Rerefence ID:123-123。
我需要将数据123-123保存到sql表中的一列。
2.如果数据质量差,如何阅读内容。
3.是否有任何方法可以读取存储为图像的文本?
4.在某些情况下,我发现100美元的读数为8100。有没有办法确保阅读文本的准确性?
请帮助我。
I have to read certain data from a PDF document and to store them in sql 2008 table.
I used itextsharp to read the complete PDFs and show in a rich text box.But the text if stored as image it will not read the content.
Now what I need to do is,
1. Suppose my PDF contains the data Rerefence ID : 123-123 .
I need to save the data 123-123 to a column in sql table.
2. How to read the content if the quality of the data is poor.
3. Is there any method to read text stored as image?
4. In some case I found $100 is read as 8100 . Is there any way to ensure the accuracy of the read text ?
Please help me.
推荐答案
100读为8100。有没有办法确保阅读文本的准确性?
请帮帮我。
100 is read as 8100 . Is there any way to ensure the accuracy of the read text ?
Please help me.
(3)是:你需要OCR转换你的图像 - 找到一个准确的免费软件api将是一个问题
(3) yes : you need to OCR convert your image - finding an accurate freeware api will be an issue
这篇关于PDF内容读取到SQL表的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!