如何在Python 3中解析原始HTTP请求? [英] How to parse raw HTTP request in Python 3?
问题描述
我正在寻找一种在Python 3中解析http请求的本地方法。
I am looking for a native way to parse an http request in Python 3.
这个问题显示了在Python 2中实现它的方法,但现在使用已弃用的模块(和Python 2),我正在寻找一种在Python 3中实现它的方法。
This question shows a way to do it in Python 2, but uses now deprecated modules, (and Python 2) and I am looking for a way to do it in Python 3.
我主要想知道请求什么资源并解析标头和简单的请求。 (即):
I would mainly like to just figure out what resource is requested and parse the headers and from a simple request. (i.e):
GET /index.html HTTP/1.1
Host: localhost
Connection: keep-alive
Cache-Control: max-age=0
Upgrade-Insecure-Requests: 1
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/52.0.2743.116 Safari/537.36
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8
Accept-Encoding: gzip, deflate, sdch
Accept-Language: en-US,en;q=0.8
有人能告诉我解析这个请求的基本方法吗?
Can someone show me a basic way to parse this request?
推荐答案
你可以使用 email.message.Message $ c $来自
模块。 电子邮件$ c的c>
课程标准库中的$ c>
You could use the email.message.Message
class from the email
module in the standard library.
通过修改您链接的问题的答案,是一个解析HTTP头的Python3示例。
By modifying the answer from the question you linked, below is a Python3 example of parsing HTTP headers.
假设您要创建一个包含所有标题字段的字典:
Suppose you wanted to create a dictionary containing all of your header fields:
import email
import pprint
from io import StringIO
request_string = 'GET / HTTP/1.1\r\nHost: localhost\r\nConnection: keep-alive\r\nCache-Control: max-age=0\r\nUpgrade-Insecure-Requests: 1\r\nUser-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/52.0.2743.116 Safari/537.36\r\nAccept: text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8\r\nAccept-Encoding: gzip, deflate, sdch\r\nAccept-Language: en-US,en;q=0.8'
# pop the first line so we only process headers
_, headers = request_string.split('\r\n', 1)
# construct a message from the request string
message = email.message_from_file(StringIO(headers))
# construct a dictionary containing the headers
headers = dict(message.items())
# pretty-print the dictionary of headers
pprint.pprint(headers, width=160)
如果你在python提示符下运行它,结果将如下所示:
if you ran this at a python prompt, the result would look like:
{'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8',
'Accept-Encoding': 'gzip, deflate, sdch',
'Accept-Language': 'en-US,en;q=0.8',
'Cache-Control': 'max-age=0',
'Connection': 'keep-alive',
'Host': 'localhost',
'Upgrade-Insecure-Requests': '1',
'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/52.0.2743.116 Safari/537.36'}
这篇关于如何在Python 3中解析原始HTTP请求?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!