使用Python *刮内容的.aspx [英] Scraping *.aspx content using Python

查看:272
本文介绍了使用Python *刮内容的.aspx的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有在刮ASPX动态生成的表的困难。试图从一个站点刮天然气价格这样 GasPrices 。我可以提取在天然气价格表(地址,提交时间等)的所有信息,除了实际的天然气价格。

I'm having difficulties scraping dynamically generated table in ASPX. Trying to scrape the gas prices from a site like this GasPrices. I can extract all the information in the gas price table (address, time submitted etc.), except for the actual gas price.

有没有一种方法,我可以凑天然气价格?即以某种方式得到它的文本再presentation。我不是很熟悉ASP / ASPX - 但就是生成的内容,现在没有出现在最终的HTML起来。我使用Python做刮痧,但是这是不相关的,除非有一个特定的库...

Is there a way I could scrape the gas prices? i.e. somehow get a text representation of it. I'm not very familiar with ASP/ASPX - but what's being generated now is not showing up in the final HTML. I'm using Python to do the scraping, but that's irrelevant unless there's a specific library...

先谢谢了。

推荐答案

<击>页(CSS)的起源不是一个问题在这里。

The origin of the page (aspx) is not an issue here.

看起来他们正在积极试图阻挠刮尝试。这些数字是不是字体,而他们​​几个div元素旁边的背景图像彼此是数字。的他们真的不希望被刮掉。

It looks like they're actively trying to thwart scraping attempts. The numbers are not fonts, rather they several div elements next to one another with background images that are numbers. They really don't want to be scraped.

(当然,如果你真的确定你很可能映射div的类名......他们不是很好'加密')

(of course, if you were really determined you could probably map the class name of the div to... They're not very well 'encrypted')

记的版权声明的链接页面底部

这篇关于使用Python *刮内容的.aspx的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆