用java jsoup想下载一个表格,但是网站总是进不去,求指点

查看:95
本文介绍了用java jsoup想下载一个表格,但是网站总是进不去,求指点的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

问 题

网址是这个http://www.zwsw.gov.cn/zwsw20...
我想用jsoup下载这个网址的表格,从源代码看到还挺简单,但是jsoup每次抓到的document只有网页基本信息。希望大神知道一下!谢谢!!

输出为:
<html>
<head>
<meta http-equiv="Content-Type" content="text/html;charset=utf-8">
<title>珠江委水文水资源网</title>
</head>
<body onload="t3_ar_guard();">
<script>
function t3_ar_guard() {
eval(function(p,a,c,k,e,d){e=function(c){return c};if(!''.replace(/^/,String)){while(c--){d[c]=k[c]||c}k=[function(e){return d[e]}];e=function(){return'\w+'};c=1};while(c--){if(k[c]){p=p.replace(new RegExp('\b'+e(c)+'\b','g'),k[c])}}return p}('0.3="4=7/6;5=/";0.2.1=0.2.1;',8,8,'document|href|location|cookie|ant_stream_58ee42b978925|path|474259458|1504312292'.split('|'),0,{}))}
</script>

</body>
</html>

代码为:

 String html2 = "http://www.zwsw.gov.cn/zwsw2011/asp/sqxq/main_sqxq.asp";
     Document doc = Jsoup.connect(html2).get();
     System.out.println(doc);
     
     

解决方案

public class Main {
    public static void main(String[] args) throws IOException {
        Document doc = Jsoup.connect("http://www.zwsw.gov.cn/zwsw2011/asp/sqxq/main_sqxq.asp").cookie("ant_stream_58ee42b978925","1504345521/771910499").get();
        Element body = doc.body();
        System.out.println(body.html());
    }
}

添加一下cookie值就可以拿到了。
输出如下:

<table width="95%" border="0" align="center" cellpadding="0" cellspacing="0"> 
 <tbody>
  <tr> 
   <td>
    <table width="100%"> 
     <tbody>
      <tr height="20"> 
       <td width="70%" align="left"><font color="#FF0000" face="华文新魏" size="5">珠江委各咸情站最近两小时咸情详情信息</font></td> 
       <td width="20%"></td> 
      </tr> 
     </tbody>
    </table> 
    <table width="100%" border="1" cellpadding="0" cellspacing="0" style="border: 3 double;border-collapse: collapse" bordercolor="#0066FF"> 
     <tbody>
      <tr height="30"> 
       <td align="center" background="/images/bg_head.gif" style="color: #FFFFFF; font-weight: bold" colspan="2">统计时间</td> 
       <td align="center" background="/images/bg_head.gif" style="color: #FFFFFF; font-weight: bold" colspan="1">竹银咸情站</td> 
       <td align="center" background="/images/bg_head.gif" style="color: #FFFFFF; font-weight: bold" colspan="1">联石湾咸情站</td> 
       <td align="center" background="/images/bg_head.gif" style="color: #FFFFFF; font-weight: bold" colspan="1">平岗咸情站站</td> 
      </tr> 
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td colspan="2">昨日(9月1日0时至23时)</td>
       <td colspan="1">均未超标</td>
       <td colspan="1">均未超标</td>
       <td colspan="1">均未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td colspan="2">今日(9月2日0时至10时)</td>
       <td colspan="1">均未超标</td>
       <td colspan="1">均未超标</td>
       <td colspan="1">均未超标</td>
      </tr> 
      <tr height="30"> 
       <td align="center" background="/images/bg_head.gif" style="color: #FFFFFF; font-weight: bold">咸&nbsp;&nbsp;情&nbsp;&nbsp;站</td> 
       <td align="center" background="/images/bg_head.gif" style="color: #FFFFFF; font-weight: bold">日&nbsp;&nbsp;&nbsp;期</td> 
       <td align="center" background="/images/bg_head.gif" style="color: #FFFFFF; font-weight: bold">珠基水位(m)</td> 
       <td align="center" background="/images/bg_head.gif" style="color: #FFFFFF; font-weight: bold">含氯度(mg/l)</td> 
       <td align="center" background="/images/bg_head.gif" style="color: #FFFFFF; font-weight: bold">是否超标</td> 
      </tr> 
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>竹银咸情站</td>
       <td>2017-9-2 9:00:00</td>
       <td>2.93</td>
       <td>66</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>竹银咸情站</td>
       <td>2017-9-2 7:00:00</td>
       <td>3.01</td>
       <td>66</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>平岗咸情站</td>
       <td>2017-9-2 8:00:00</td>
       <td>―</td>
       <td>66</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>平岗咸情站</td>
       <td>2017-9-2 7:00:00</td>
       <td>―</td>
       <td>66</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>马骝州咸情站</td>
       <td>2017-9-2 9:00:00</td>
       <td>.64</td>
       <td>77</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>马骝州咸情站</td>
       <td>2017-9-2 8:00:00</td>
       <td>.77</td>
       <td>77</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>马骝州咸情站</td>
       <td>2017-9-2 7:00:00</td>
       <td>.85</td>
       <td>77</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>大横琴咸情站</td>
       <td>2017-9-2 9:00:00</td>
       <td>2.63</td>
       <td>77</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>大横琴咸情站</td>
       <td>2017-9-2 8:00:00</td>
       <td>2.78</td>
       <td>77</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>大横琴咸情站</td>
       <td>2017-9-2 7:00:00</td>
       <td>2.84</td>
       <td>77</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>沙洛围咸情站</td>
       <td>2017-9-2 9:00:00</td>
       <td>―</td>
       <td>61</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>沙洛围咸情站</td>
       <td>2017-9-2 8:00:00</td>
       <td>―</td>
       <td>66</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>沙洛围咸情站</td>
       <td>2017-9-2 7:00:00</td>
       <td>―</td>
       <td>66</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>冯马庙咸情站</td>
       <td>2017-9-2 9:00:00</td>
       <td>1.69</td>
       <td>66</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>冯马庙咸情站</td>
       <td>2017-9-2 8:00:00</td>
       <td>1.75</td>
       <td>66</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>冯马庙咸情站</td>
       <td>2017-9-2 7:00:00</td>
       <td>1.74</td>
       <td>66</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>大虎咸情站</td>
       <td>2017-9-2 9:00:00</td>
       <td>1.69</td>
       <td>504</td>
       <td><font color="#FF0000">超标</font></td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>大虎咸情站</td>
       <td>2017-9-2 8:00:00</td>
       <td>1.73</td>
       <td>338</td>
       <td><font color="#FF0000">超标</font></td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>大虎咸情站</td>
       <td>2017-9-2 7:00:00</td>
       <td>1.69</td>
       <td>232</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>大虎咸情站</td>
       <td>2017-9-2 6:00:00</td>
       <td>1.53</td>
       <td>249</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>内伶仃咸情站</td>
       <td>2017-9-2 9:00:00</td>
       <td>―</td>
       <td>1882</td>
       <td><font color="#FF0000">超标</font></td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>内伶仃咸情站</td>
       <td>2017-9-2 7:00:00</td>
       <td>―</td>
       <td>1755</td>
       <td><font color="#FF0000">超标</font></td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>大九洲咸情站</td>
       <td>2017-9-2 7:00:00</td>
       <td>―</td>
       <td>4882</td>
       <td><font color="#FF0000">超标</font></td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>下横咸情站</td>
       <td>2017-9-2 9:00:00</td>
       <td>―</td>
       <td>66</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#FFFFFF" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#FFFFFF'">
       <td>下横咸情站</td>
       <td>2017-9-2 8:00:00</td>
       <td>―</td>
       <td>66</td>
       <td>未超标</td>
      </tr>
      <tr height="25" align="center" bgcolor="#E3FFE3" onmouseover="this.bgColor='#d7dcf2'" onmouseout="this.bgColor='#E3FFE3'">
       <td>下横咸情站</td>
       <td>2017-9-2 7:00:00</td>
       <td>―</td>
       <td>66</td>
       <td>未超标</td>
      </tr> 
     </tbody>
    </table> 
    <table border="0" cellpadding="0" cellspacing="0" width="100%" style="color: #0000FF"> 
     <tbody>
      <tr> 
       <td align="left"><font color="#FF0000">最近更新:2017-9-2 10:03:02</font></td> 
       <td align="right" height="50">水利部珠江水利委员会水文局</td> 
      </tr> 
     </tbody>
    </table></td> 
  </tr> 
  <tr> 
   <td> 
    <table width="100%"> 
     <tbody>
      <tr height="20"> 
       <td width="70%" align="left"><font color="#FF0000" face="华文新魏" size="5">流域主要旱片最新墒情信息</font></td> 
       <td width="20%"></td> 
      </tr> 
     </tbody>
    </table>
    <table width="100%" border="1" cellpadding="0" cellspacing="0" style="border: 3 double;border-collapse: collapse" bordercolor="#0066FF"> 
     <tbody>
      <tr height="28" align="center" style="color: #FFFFFF"> 
       <td align="center" width="25%" background="/images/bg_head.gif">站名</td> 
       <td align="center" width="25%" background="/images/bg_head.gif">所属旱片</td> 
       <td align="center" width="25%" background="/images/bg_head.gif">时间</td> 
       <td align="center" width="25%" background="/images/bg_head.gif">平均含水量(%)</td> 
      </tr> 
      <tr height="25" bgcolor="#E3FFE3" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#E3FFE3'">
       <td align="center">蛮耗 </td>
       <td align="center">红河旱片</td>
       <td align="center">2017-9-1 8:00:00</td>
       <td align="center">27.4</td>
      </tr>
      <tr height="25" bgcolor="#FFFFFF" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#FFFFFF'">
       <td align="center">南洞口 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-9-1 8:00:00</td>
       <td align="center">23.3</td>
      </tr>
      <tr height="25" bgcolor="#E3FFE3" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#E3FFE3'">
       <td align="center">江边街 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-9-1 8:00:00</td>
       <td align="center">19.3</td>
      </tr>
      <tr height="25" bgcolor="#FFFFFF" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#FFFFFF'">
       <td align="center">榕峰 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-9-1 8:00:00</td>
       <td align="center">25.7</td>
      </tr>
      <tr height="25" bgcolor="#E3FFE3" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#E3FFE3'">
       <td align="center">沾益 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-9-1 8:00:00</td>
       <td align="center">27.7</td>
      </tr>
      <tr height="25" bgcolor="#FFFFFF" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#FFFFFF'">
       <td align="center">西桥 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-9-1 8:00:00</td>
       <td align="center">22.2</td>
      </tr>
      <tr height="25" bgcolor="#E3FFE3" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#E3FFE3'">
       <td align="center">南洞口 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-27 8:00:00</td>
       <td align="center">23.3</td>
      </tr>
      <tr height="25" bgcolor="#FFFFFF" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#FFFFFF'">
       <td align="center">江边街 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-21 8:00:00</td>
       <td align="center">18.9</td>
      </tr>
      <tr height="25" bgcolor="#E3FFE3" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#E3FFE3'">
       <td align="center">南洞口 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-21 8:00:00</td>
       <td align="center">23.2</td>
      </tr>
      <tr height="25" bgcolor="#FFFFFF" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#FFFFFF'">
       <td align="center">蛮耗 </td>
       <td align="center">红河旱片</td>
       <td align="center">2017-8-21 8:00:00</td>
       <td align="center">28.6</td>
      </tr>
      <tr height="25" bgcolor="#E3FFE3" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#E3FFE3'">
       <td align="center">西桥 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-21 8:00:00</td>
       <td align="center">22.2</td>
      </tr>
      <tr height="25" bgcolor="#FFFFFF" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#FFFFFF'">
       <td align="center">沾益 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-21 8:00:00</td>
       <td align="center">24.9</td>
      </tr>
      <tr height="25" bgcolor="#E3FFE3" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#E3FFE3'">
       <td align="center">榕峰 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-21 8:00:00</td>
       <td align="center">20.9</td>
      </tr>
      <tr height="25" bgcolor="#FFFFFF" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#FFFFFF'">
       <td align="center">南洞口 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-11 8:00:00</td>
       <td align="center">23.3</td>
      </tr>
      <tr height="25" bgcolor="#E3FFE3" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#E3FFE3'">
       <td align="center">西桥 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-11 8:00:00</td>
       <td align="center">19.1</td>
      </tr>
      <tr height="25" bgcolor="#FFFFFF" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#FFFFFF'">
       <td align="center">沾益 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-11 8:00:00</td>
       <td align="center">28</td>
      </tr>
      <tr height="25" bgcolor="#E3FFE3" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#E3FFE3'">
       <td align="center">榕峰 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-11 8:00:00</td>
       <td align="center">25.3</td>
      </tr>
      <tr height="25" bgcolor="#FFFFFF" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#FFFFFF'">
       <td align="center">蛮耗 </td>
       <td align="center">红河旱片</td>
       <td align="center">2017-8-11 8:00:00</td>
       <td align="center">28.2</td>
      </tr>
      <tr height="25" bgcolor="#E3FFE3" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#E3FFE3'">
       <td align="center">江边街 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-11 8:00:00</td>
       <td align="center">17.2</td>
      </tr>
      <tr height="25" bgcolor="#FFFFFF" onmouseover="this.bgColor='#cbe7f6'" onmouseout="this.bgColor='#FFFFFF'">
       <td align="center">沾益 </td>
       <td align="center">南盘江旱片</td>
       <td align="center">2017-8-1 8:00:00</td>
       <td align="center">26.8</td> 
      </tr> 
     </tbody>
    </table> 
    <table border="0" cellpadding="0" cellspacing="0" width="100%" style="color: #0000FF"> 
     <tbody>
      <tr> 
       <td align="left"><font color="#FF0000">最近更新: 2017-9-2 10:03:02</font></td> 
       <td align="right" height="50">水资源保护局、水文局</td> 
      </tr> 
     </tbody>
    </table> </td> 
  </tr> 
 </tbody>
</table>

这篇关于用java jsoup想下载一个表格,但是网站总是进不去,求指点的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆