如何使用selenium获取带有javascript呈现源代码的html [英] How to get html with javascript rendered sourcecode by using selenium
问题描述
我在一个网页上运行查询,然后我得到结果网址。如果我右键单击查看html源代码,我可以看到JS生成的html代码。如果我只是使用urllib,python就无法获取JS代码。所以我看到了一些使用硒的解决方案。这是我的代码:
I run a query in one web page, then I get result url. If I right click see html source, I can see the html code generated by JS. If I simply use urllib, python cannot get the JS code. So I see some solution using selenium. Here's my code:
from selenium import webdriver
url = 'http://www.archives.com/member/Default.aspx?_act=VitalSearchResult&lastName=Smith&state=UT&country=US&deathYear=2004&deathYearSpan=10&location=UT&activityID=9b79d578-b2a7-4665-9021-b104999cf031&RecordType=2'
driver = webdriver.PhantomJS(executable_path='C:\python27\scripts\phantomjs.exe')
driver.get(url)
print driver.page_source
>>> <html><head></head><body></body></html> Obviously It's not right!!
这是我在右键单击窗口中需要的源代码,(我想要信息部分)
Here's the source code I need in right click windows, (I want the INFORMATION part)
</script></div><div class="searchColRight"><div id="topActions" class="clearfix
noPrint"><div id="breadcrumbs" class="left"><a title="Results Summary"
href="Default.aspx? _act=VitalSearchR ...... <<INFORMATION I NEED>> ...
to view the entire record.</p></div><script xmlns:msxsl="urn:schemas-microsoft-com:xslt">
jQuery(document).ready(function() {
jQuery(".ancestry-information-tooltip").actooltip({
href: "#AncestryInformationTooltip", orientation: "bottomleft"});
});
===========所以我的问题是===============
如何获取JS生成的信息?
=========== So my question is =============== How to get the information generated by JS?
推荐答案
您需要通过获取文档javascript
你可以使用seleniums execute_script
function
You will need to get get the document via javascript
you can use seleniums execute_script
function
from time import sleep # this should go at the top of the file
sleep(5)
html = driver.execute_script("return document.getElementsByTagName('html')[0].innerHTML")
print html
这将获得< html>
tag
这篇关于如何使用selenium获取带有javascript呈现源代码的html的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!