在 Chrome 中加载页面时，如何捕获所有网络请求和完整响应数据? [英] How can I capture all network requests and full response data when loading a page in Chrome?

查看：111 发布时间：2021/12/10 15:30:45 javascript google-chrome puppeteer

本文介绍了在 Chrome 中加载页面时，如何捕获所有网络请求和完整响应数据?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

使用 Puppeteer，我想在 Chrome 中加载一个 URL 并捕获以下信息:

Using Puppeteer, I'd like to load a URL in Chrome and capture the following information:

请求网址
请求头
请求发布数据
响应标题文本(包括重复的标题，如 set-cookie)
传输的响应大小(即压缩大小)
完整的响应正文

捕获完整的响应正文是我遇到问题的原因.

Capturing the full response body is what causes the problems for me.

我尝试过的事情:

使用 response.buffer - 如果在任何时候有重定向，这都不起作用，因为缓冲区被擦除在导航
拦截请求并使用 getResponseBodyForInterception - 这意味着我可以不再访问编码长度，并且在某些情况下我也无法获取正确的请求和响应标头
使用本地代理有效，但这会显着减慢页面加载时间(并且还改变了一些行为，例如证书错误)



Getting response content with response.buffer - this does not work if there are redirects at any point, since buffers are wiped on navigation
intercepting requests and using getResponseBodyForInterception - this means I can no longer access the encodedLength, and I also had problems getting the correct request and response headers in some cases
Using a local proxy works, but this slowed down page load times significantly (and also changed some behavior for e.g. certificate errors)

理想情况下，该解决方案应该只对性能产生很小的影响，并且与正常加载页面没有功能差异.我也想避免分叉 Chrome.
Ideally the solution should only have a minor performance impact and have no functional differences from loading a page normally. I would also like to avoid forking Chrome.
推荐答案
您可以使用 page.setRequestInterception() 对于每个请求，然后在 page.on('request')，你可以使用request-promise-native 模块充当中间人收集响应数据之前使用 request.continue() 在 Puppeteer 中.
You can enable a request interception with page.setRequestInterception() for each request, and then, inside page.on('request'), you can use the request-promise-native module to act as a middle man to gather the response data before continuing the request with request.continue() in Puppeteer.
这是一个完整的工作示例:
'use strict';

const puppeteer = require('puppeteer');
const request_client = require('request-promise-native');

(async () => {
  const browser = await puppeteer.launch();
  const page = await browser.newPage();
  const result = [];

  await page.setRequestInterception(true);

  page.on('request', request => {
    request_client({
      uri: request.url(),
      resolveWithFullResponse: true,
    }).then(response => {
      const request_url = request.url();
      const request_headers = request.headers();
      const request_post_data = request.postData();
      const response_headers = response.headers;
      const response_size = response_headers['content-length'];
      const response_body = response.body;

      result.push({
        request_url,
        request_headers,
        request_post_data,
        response_headers,
        response_size,
        response_body,
      });

      console.log(result);
      request.continue();
    }).catch(error => {
      console.error(error);
      request.abort();
    });
  });

  await page.goto('https://example.com/', {
    waitUntil: 'networkidle0',
  });

  await browser.close();
})();


                        这篇关于在 Chrome 中加载页面时，如何捕获所有网络请求和完整响应数据?的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！


                    
                        查看全文

在 Chrome 中加载页面时，如何捕获所有网络请求和完整响应数据? [英] How can I capture all network requests and full response data when loading a page in Chrome?

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录关闭

在 Chrome 中加载页面时，如何捕获所有网络请求和完整响应数据? [英] How can I capture all network requests and full response data when loading a page in Chrome?

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录 关闭

登录关闭