使用 Selenium 和 python 抓取 [英] Scraping using Selenium and python
本文介绍了使用 Selenium 和 python 抓取的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
我正在尝试获取诸如 14,401 之类的数字.我试过类似
I am trying to get numbers such as 14,401. I tried something like
WebDriverWait(driver, 10).until(EC.frame_to_be_available_and_switch_to_it((By.XPATH,"//iframe[@id='wiz-iframe-intent']")))
WebDriverWait(driver, 10).until(EC.element_to_be_clickable((By.XPATH, '//*[@id="highcharts-5tawcvw-0"]/svg/g[8]/g[1]/text/tspan[1]'))).text
获取超时异常.
推荐答案
那个特定元素在 SVG 中.你可以按照这个代码:
That particular element is in SVG. you can follow this code :
from selenium import webdriver
from selenium.webdriver.common.keys import Keys
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.action_chains import ActionChains
from selenium.webdriver.support.select import Select
import time
driver = webdriver.Chrome(executable_path = r'D:/Automation/chromedriver.exe')
driver.maximize_window()
driver.get("https://eu1.dashboard.clevertap.com/login.html")
wait = WebDriverWait(driver, 20)
action = ActionChains(driver)
wait.until(EC.frame_to_be_available_and_switch_to_it((By.ID,"wiz-iframe-intent")))
wait.until(EC.element_to_be_clickable((By.XPATH, "//div[@class='CT_Interstitial']//span[@class='CT_InterstitialClose']"))).click()
driver.switch_to.default_content()
wait.until(EC.element_to_be_clickable((By.NAME, "email"))).send_keys("username")
wait.until(EC.element_to_be_clickable((By.NAME,"password"))).send_keys("****")
wait.until(EC.element_to_be_clickable((By.ID,"submitBtn"))).click()
wait.until(EC.visibility_of_element_located((By.CSS_SELECTOR, 'div.ct-breadcrumb')))
try:
wait.until(EC.frame_to_be_available_and_switch_to_it((By.ID,"wiz-iframe-intent")))
wait.until(EC.element_to_be_clickable((By.CSS_SELECTOR, "div.close-btn.js-close-popup>button"))).click()
except:
pass
driver.switch_to.default_content()
action.move_to_element(driver.find_element_by_css_selector("div.sidebar__brand+ul>li:first-child>a")).perform()
wait.until(EC.visibility_of_element_located((By.LINK_TEXT, "All Boards")))
wait.until(EC.element_to_be_clickable((By.LINK_TEXT,"All Boards"))).click()
wait.until(EC.element_to_be_clickable((By.LINK_TEXT,"pe-funnel"))).click()
wait.until(EC.visibility_of_element_located((By.XPATH, "//*[name()='tspan' and contains(@stroke-linejoin,'round')]")))
all_data = driver.find_elements_by_xpath("//*[name()='tspan' and contains(@stroke-linejoin,'round')]")
print(len(all_data))
for data in all_data:
print(data.text)
这篇关于使用 Selenium 和 python 抓取的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文