如何从Selenium Webdriver Firefox中的`driver.page_source`获取HTTP请求的原始JSON响应 [英] How to get the raw JSON response of a HTTP request from `driver.page_source` in Selenium webdriver Firefox

查看:145
本文介绍了如何从Selenium Webdriver Firefox中的`driver.page_source`获取HTTP请求的原始JSON响应的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

如果我浏览到 https://httpbin.org/headers ,我希望得到以下JSON响应:

If I browse to https://httpbin.org/headers I expect to get the following JSON response:

{
  "headers": {
    "Accept": "text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8", 
    "Accept-Encoding": "gzip, deflate, br", 
    "Accept-Language": "en-US,en;q=0.5", 
    "Connection": "close", 
    "Host": "httpbin.org", 
    "Upgrade-Insecure-Requests": "1", 
    "User-Agent": "Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:64.0) Gecko/20100101 Firefox/64.0"
  }
}

但是,如果我使用硒

from selenium import webdriver
from selenium.webdriver.firefox.options import Options

options = Options()
options.headless = True
driver = webdriver.Firefox(options=options)

url = 'https://httpbin.org/headers'
driver.get(url)
print(driver.page_source)
driver.close()

我知道

<html platform="linux" class="theme-light" dir="ltr"><head><meta http-equiv="Content-Security-Policy" content="default-src 'none' ; script-src resource:; "><link rel="stylesheet" type="text/css" href="resource://devtools-client-jsonview/css/main.css"><script type="text/javascript" charset="utf-8" async="" data-requirecontext="_" data-requiremodule="viewer-config" src="resource://devtools-client-jsonview/viewer-config.js"></script><script type="text/javascript" charset="utf-8" async="" data-requirecontext="_" data-requiremodule="json-viewer" src="resource://devtools-client-jsonview/json-viewer.js"></script></head><body><div id="content"><div id="json">{
  "headers": {
    "Accept": "text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8", 
    "Accept-Encoding": "gzip, deflate, br", 
    "Accept-Language": "en-US,en;q=0.5", 
    "Connection": "close", 
    "Host": "httpbin.org", 
    "Upgrade-Insecure-Requests": "1", 
    "User-Agent": "Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:64.0) Gecko/20100101 Firefox/64.0"
  }
}
</div></div><script src="resource://devtools-client-jsonview/lib/require.js" data-main="resource://devtools-client-jsonview/viewer-config.js"></script></body></html>

HTML标记来自哪里?如何从driver.page_source获取HTTP请求的原始JSON响应?

Where do the HTML tags come from? How do I get the raw JSON response of a HTTP request from driver.page_source?

推荐答案

除了原始JSON响应外,driver.page_source还包含HTML,以漂亮地打印"浏览器中的响应.如果使用Firefox DOM和Style Inspector在浏览器中查看JSON响应的源,您将得到相同的结果.

In addition to the raw JSON response, driver.page_source also contains the HTML to "pretty print" the response in the browser. You'll get the same result, if you use the Firefox DOM and Style Inspector to view the source of the JSON response in the browser.

要获取原始JSON响应,您可以照常浏览HTML元素:

To get the raw JSON response you can navigate the HTML elements as usual:

print(driver.find_element_by_xpath("//div[@id='json']").text)

这篇关于如何从Selenium Webdriver Firefox中的`driver.page_source`获取HTTP请求的原始JSON响应的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆