PHP：如何抓取基于Javascript的网站内容 [英] PHP: How to scrape content of the website based on Javascript

查看：388 发布时间：2017/3/5 22:04:00 javascript php curl web-scraping noscript

本文介绍了PHP：如何抓取基于Javascript的网站内容的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我想使用PHP simplehtmldom库来获取本网站的内容。

I'm trying to get content of this website using PHP simplehtmldom library.

http://www.immigration.govt.nz/migrant/stream/work/workingholiday/czechwhs.htm

它不工作，所以我尝试使用CURL：

It is not working, so i tried using CURL:

function curl_get_file_contents($URL)
{
    $c = curl_init();
    curl_setopt($c, CURLOPT_RETURNTRANSFER, 1);
    curl_setopt($c, CURLOPT_URL, $URL);
    $contents = curl_exec($c);
    curl_close($c);

    if ($contents) return $contents;
    else return FALSE;
}

但是总是只能使用一些JS代码和内容进行respose：

But always get only respose with some JS code and content:

<noscript>Please enable JavaScript to view the page content.</noscript>

有没有可能使用PHP解决这个问题？在这种情况下我必须使用PHP，所以我需要模拟基于JS的浏览器。

Is any possibility to solve this using PHP? I must use PHP in this case so i need to simulate JS based browser.

非常感谢任何建议。

PHP：如何抓取基于Javascript的网站内容 [英] PHP: How to scrape content of the website based on Javascript

问题描述

推荐答案

相关文章

PHP最新文章

热门教程

热门工具

登录关闭

PHP：如何抓取基于Javascript的网站内容 [英] PHP: How to scrape content of the website based on Javascript

问题描述

推荐答案

相关文章

PHP最新文章

热门教程

热门工具

登录 关闭

登录关闭