通过 BeautifulSoup 从网页下载图像数据 URI [英] Downloading Image Data URIs from Webpages via BeautifulSoup
本文介绍了通过 BeautifulSoup 从网页下载图像数据 URI的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
我需要使用 Python 从网站检索图像.但是,图像不是链接文件的形式,而是 GIF 数据 URI.如何下载它并将其存储在 .gif 文件中?
I need to retrieve an image from a website using Python. However, the image is not in the form of a linked file, but as a GIF Data URI. How do I download this and store it in a .gif file?
推荐答案
这应该会让你朝着正确的方向前进.
This should get you going in the correct direction.
首先,我假设您已检索图像 uri 数据并将其保存在名为 img_data 的 python 变量中:
First, I'll assume you have retrieved the image uri data and it is saved in a python variable called img_data:
# Example
img_data = 'data:image/jpeg;base64,/9j/4A...<lots of data>...k='
现在您需要从 base64 解码图片并将其保存到文件中:
Now you'll need to decode the picture from base64 and save it to a file:
import base64
# Separate the metadata from the image data
head, data = img_data.split(',', 1)
# Get the file extension (gif, jpeg, png)
file_ext = head.split(';')[0].split('/')[1]
# Decode the image data
plain_data = base64.b64decode(data)
# Write the image to a file
with open('image.' + file_ext, 'wb') as f:
f.write(plain_data)
这篇关于通过 BeautifulSoup 从网页下载图像数据 URI的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文