如何自动OCR照片并将其内容导入SQL数据库 [英] how to auto OCR photo and import its content into SQL database
问题描述
我有几百张PDF或JPEG格式的发票。
I have several hundreds of invoice in PDF or JPEG format.
我想自动OCR这些文件并将它们导入SQL数据库。
I want to auto OCR these files and import them into SQL database.
我可以使用哪些Azure产品?
What are the Azure products I can use?
谢谢,
Joe
推荐答案
您好UOG Admin,
Hi UOG Admin,
您可以使用以下
tutorial 作为一个示例解决方案作为起点。它需要针对SQL数据库进行修改,并且需要编写客户端来拉取每个.pdf或.jpg,调用OCR计算机视觉API,并将图像缩略图+元数据提交到架构的
后端。
You can use the following tutorial as an example solution as a starting point. It would need to be modified for SQL Database and a client would need to be written to pull each .pdf or .jpg, call the OCR Computer Vision API, and commit the image thumbnail + metadata to a architected back-end.
将图像存储在blob存储中,然后将Cosmos DB或Azure SQL数据库用于可以运行报表等的元数据,并包含一个链接到图像但不实际将图像存储在数据库中。
It is likely more efficient to store the image in blob storage and then use Cosmos DB or Azure SQL Database for the metadata where you can run reports, etc., and include a link to the image but not actually store the image in the database.
以下面的例子为例: 使用Cosmos DB的零售和电子商务
Take the following for example: Retail and e-commerce using Cosmos DB
你只能使用红色框中的内容。浏览器可以是API端点,但整体解决方案为您提供搜索功能和高效的图像存储和访问。可以使用Azure SQL数据库代替Cosmos DB,但在这种情况下,为此设置了
Cosmos。
Where you would only use what is in the red box. The browser could be an API endpoint but the overall solution gives you search capabilities and efficient image storage and access. Azure SQL Database could be used in place of Cosmos DB but in this scenario, Cosmos is set-up for this.
这篇关于如何自动OCR照片并将其内容导入SQL数据库的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!