使用 Selenium 和 python 抓取 [英] Scraping using Selenium and python

查看:41
本文介绍了使用 Selenium 和 python 抓取的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在尝试获取诸如 14,401 之类的数字.我试过类似

I am trying to get numbers such as 14,401. I tried something like

WebDriverWait(driver, 10).until(EC.frame_to_be_available_and_switch_to_it((By.XPATH,"//iframe[@id='wiz-iframe-intent']")))
WebDriverWait(driver, 10).until(EC.element_to_be_clickable((By.XPATH, '//*[@id="highcharts-5tawcvw-0"]/svg/g[8]/g[1]/text/tspan[1]'))).text

获取超时异常.

推荐答案

那个特定元素在 SVG 中.你可以按照这个代码:

That particular element is in SVG. you can follow this code :

from selenium import webdriver
from selenium.webdriver.common.keys import Keys
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC 
from selenium.webdriver.common.action_chains import ActionChains
from selenium.webdriver.support.select import Select
import time

driver = webdriver.Chrome(executable_path = r'D:/Automation/chromedriver.exe')
driver.maximize_window()
driver.get("https://eu1.dashboard.clevertap.com/login.html")

wait = WebDriverWait(driver, 20)
action = ActionChains(driver)
wait.until(EC.frame_to_be_available_and_switch_to_it((By.ID,"wiz-iframe-intent")))
wait.until(EC.element_to_be_clickable((By.XPATH, "//div[@class='CT_Interstitial']//span[@class='CT_InterstitialClose']"))).click()

driver.switch_to.default_content()

wait.until(EC.element_to_be_clickable((By.NAME, "email"))).send_keys("username")

wait.until(EC.element_to_be_clickable((By.NAME,"password"))).send_keys("****")  

wait.until(EC.element_to_be_clickable((By.ID,"submitBtn"))).click()

wait.until(EC.visibility_of_element_located((By.CSS_SELECTOR, 'div.ct-breadcrumb')))

try:
    wait.until(EC.frame_to_be_available_and_switch_to_it((By.ID,"wiz-iframe-intent")))
    wait.until(EC.element_to_be_clickable((By.CSS_SELECTOR, "div.close-btn.js-close-popup>button"))).click()
except:
    pass

driver.switch_to.default_content()  
action.move_to_element(driver.find_element_by_css_selector("div.sidebar__brand+ul>li:first-child>a")).perform()

wait.until(EC.visibility_of_element_located((By.LINK_TEXT, "All Boards")))  

wait.until(EC.element_to_be_clickable((By.LINK_TEXT,"All Boards"))).click()

wait.until(EC.element_to_be_clickable((By.LINK_TEXT,"pe-funnel"))).click()

wait.until(EC.visibility_of_element_located((By.XPATH, "//*[name()='tspan' and contains(@stroke-linejoin,'round')]"))) 

all_data = driver.find_elements_by_xpath("//*[name()='tspan' and contains(@stroke-linejoin,'round')]")

print(len(all_data))

for data in all_data:
 print(data.text)

这篇关于使用 Selenium 和 python 抓取的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆