通过 BeautifulSoup 从网页下载图像数据 URI [英] Downloading Image Data URIs from Webpages via BeautifulSoup

查看:24
本文介绍了通过 BeautifulSoup 从网页下载图像数据 URI的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我需要使用 Python 从网站检索图像.但是,图像不是链接文件的形式,而是 GIF 数据 URI.如何下载它并将其存储在 .gif 文件中?

I need to retrieve an image from a website using Python. However, the image is not in the form of a linked file, but as a GIF Data URI. How do I download this and store it in a .gif file?

推荐答案

这应该会让你朝着正确的方向前进.

This should get you going in the correct direction.

首先,我假设您已检索图像 uri 数据并将其保存在名为 img_data 的 python 变量中:

First, I'll assume you have retrieved the image uri data and it is saved in a python variable called img_data:

# Example
img_data = '...<lots of data>...k='

现在您需要从 base64 解码图片并将其保存到文件中:

Now you'll need to decode the picture from base64 and save it to a file:

import base64

# Separate the metadata from the image data
head, data = img_data.split(',', 1)

# Get the file extension (gif, jpeg, png)
file_ext = head.split(';')[0].split('/')[1]

# Decode the image data
plain_data = base64.b64decode(data)

# Write the image to a file
with open('image.' + file_ext, 'wb') as f:
    f.write(plain_data)

这篇关于通过 BeautifulSoup 从网页下载图像数据 URI的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆