如何使用Selenium Python从reddit.com搜索页面上的问题中提取标题和href属性 [英] How to extract the title and href attributes from the questions on reddit.com search page using Selenium Python

查看：92 发布时间：2020/7/27 22:04:03 python selenium selenium-webdriver webdriver webdriverwait

本文介绍了如何使用Selenium Python从reddit.com搜索页面上的问题中提取标题和href属性的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我要在页面上刮所有问题的链接和标题

I want to scrape the link and title of all the questions on the page https://www.reddit.com/search?q=Expiration&type=link&sort=new. An element has the following structure:

<a data-click-id="body" class="SQnoC3ObvgnGjWt90zD9Z" href="/r/excel/comments/ayiahc/calculating_expiration_dates_previous_solution_no/">
    <h2 class="s1okktje-0 cDxKta">
        <span style="font-weight:normal">Calculating Expiration Dates - Previous Solution No Longer Works</span>
    </h2>
</a>

我使用questions = driver.find_elements_by_xpath('//a[@data-click-id="body"]')来获取问题，然后通过for对其进行迭代.而且我很高兴使用question.get_attribute('href')来获取链接.

I use questions = driver.find_elements_by_xpath('//a[@data-click-id="body"]') to get the questions then iterate them by for. And I coud use question.get_attribute('href') to get the link.

但是，我不知道如何从question中提取span中的标题.

However, I don't know how to extract the title inside the span (from a question).

有人知道该怎么做吗?

如何使用Selenium Python从reddit.com搜索页面上的问题中提取标题和href属性 [英] How to extract the title and href attributes from the questions on reddit.com search page using Selenium Python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

如何使用Selenium Python从reddit.com搜索页面上的问题中提取标题和href属性 [英] How to extract the title and href attributes from the questions on reddit.com search page using Selenium Python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭