试图从post后通过curl从页面抓取数据 [英] trying to grab data from a page after post via curl

查看:161
本文介绍了试图从post后通过curl从页面抓取数据的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我试图从这里抓取数据: http://mediaforest.biz/mobile/nowplaying。 aspx
在你选择一个车站的页面,并发布,那么你得到新的数据页面。但我不能抓住它,我再次得到相同的页面。

i am trying to grab data from here : http://mediaforest.biz/mobile/nowplaying.aspx in the page you select a station and post it then you get new page with data. but i cant grab it, i get the same page again.

我使用这个代码:

<?php
header ('Content-type: text/html; charset=utf-8');

$url = "http://mediaforest.biz/mobile/nowplaying.aspx";
$referer = "";

// headers
$header[] = "Host: ".parse_url($url, PHP_URL_HOST);
$header[] = "User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.1; he; rv:1.9.2.3) Gecko/20100401 Firefox/3.6.3";
$header[] = "Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8";
$header[] = "Accept-Language: he,en-us;q=0.7,en;q=0.3";
$header[] = "Accept-Encoding: gzip,deflate";
$header[] = "Accept-Charset: windows-1255,utf-8;q=0.7,*;q=0.7";
$header[] = "Keep-Alive: 115";
$header[] = "Connection: keep-alive";

$cookie="cookie.txt";
$fp=fopen($cookie,"w+");

$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch,CURLOPT_REFERER,$referer);
curl_setopt($ch, CURLOPT_TIMEOUT, 900);
curl_setopt($ch, CURLOPT_FAILONERROR, false);
curl_setopt($ch, CURLOPT_SSL_VERIFYHOST, 0);
curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, 0);
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, false);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_HTTPHEADER, $header);
curl_setopt($ch, CURLOPT_HEADER,true);
curl_setopt($ch, CURLOPT_COOKIEFILE,$cookie);
curl_setopt($ch, CURLOPT_COOKIEJAR,$cookie);
curl_setopt($ch, CURLOPT_VERBOSE, 0);

$content=curl_exec($ch);
echo $content;

    if(stristr($content,"__EVENTTARGET")){
        $array1=explode('__EVENTTARGET" value="',$content);
        $content1=$array1[1];
        $array2=explode('">
<input type="hidden" name="__EVENTARGUMENT"',$content1);
        $content2=$array2[0];
        $EVENTTARGET=urlencode($content2);
    }

    if(stristr($content,"__EVENTARGUMENT")){
        $array1=explode('__EVENTARGUMENT" value="',$content);
        $content1=$array1[1];
        $array2=explode('">
<script language',$content1);
        $content2=$array2[0];
        $EVENTARGUMENT=urlencode($content2);
    }

    if(stristr($content,"formNowPlaying")){
        $array1=explode('method="post" action="',$content);
        $content1=$array1[1];
        $array2=explode('">
<input type="hidden" name="__EVENTTARGET"',$content1);
        $content2=$array2[0];
        $nexturl=$content2;
    }

//echo $EVENTTARGET." ".$EVENTARGUMENT." ".$nexturl;

$url = "http://mediaforest.biz/mobile/".$nexturl;

$fields = "EVENTTARGET=".$EVENTTARGET."&__EVENTARGUMENT=".$EVENTARGUMENT."&MyChannels=0&ViewChannel_Button=Show"; 

curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch, CURLOPT_POST, true);
curl_setopt($ch, CURLOPT_POSTFIELDS, $fields);
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch,CURLOPT_REFERER,$referer);
curl_setopt($ch, CURLOPT_TIMEOUT, 900);
curl_setopt($ch, CURLOPT_FAILONERROR, false);
curl_setopt($ch, CURLOPT_SSL_VERIFYHOST, 0);
curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, 0);
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, false);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_HTTPHEADER, $header);
curl_setopt($ch, CURLOPT_HEADER,true);
curl_setopt($ch, CURLOPT_COOKIEFILE,$cookie);
curl_setopt($ch, CURLOPT_COOKIEJAR,$cookie);
curl_setopt($ch, CURLOPT_VERBOSE, 1);

$content_stage2=curl_exec($ch);
echo $content_stage2;
?>


推荐答案

如果您从第一个请求获取数据,请尝试关闭(对于每个请求)。

If you're getting data from the first request, try closing afterwards (for each request).

$content=curl_exec($ch); 
curl_close($ch); 
echo $content;

$content_stage2=curl_exec($ch);
curl_close($ch);    
echo $content_stage2; 

这篇关于试图从post后通过curl从页面抓取数据的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆