将数据从BeautifulSoup导出到CSV [英] Export data from BeautifulSoup to CSV

查看:294
本文介绍了将数据从BeautifulSoup导出到CSV的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

[免责声明]我已经在该地区找到了许多其他答案,但它们似乎对我没有用.

[DISCLAIMER] I have been through plenty of the other answers on the area, but they do not seem to work for me.

我希望能够将我抓取的数据导出为CSV文件.

I want to be able to export the data I have scraped as a CSV file.

我的问题是如何编写将数据输出到CSV的代码?

My question is how do I write the piece of code which outputs the data to a CSV?

当前代码

import requests
from bs4 import BeautifulSoup 

url = "http://implementconsultinggroup.com/career/#/6257"
r = requests.get(url)

req = requests.get(url).text
soup = BeautifulSoup(r.content)
links = soup.find_all("a")

for link in links:
     if "career" in link.get("href") and 'COPENHAGEN' in link.text:
             print "<a href='%s'>%s</a>" %(link.get("href"), link.text)

代码输出

View Position

</a>
<a href='/career/management-consultants-to-help-our-customers-succeed-with-
it/'>
Management consultants to help our customers succeed with IT
COPENHAGEN • At Implement Consulting Group, we wish to make a difference in 
the consulting industry, because we believe that the ability to create Change 
with Impact is a precondition for success in an increasingly global and 
turbulent world.




View Position

</a>
<a href='/career/management-consultants-within-process-improvement/'>
Management consultants within process improvement
COPENHAGEN • We are looking for consultants with profound
experience in Six Sigma, Lean and operational
management

我尝试过的代码

with open('ImplementTest1.csv',"w") as csv_file:
     writer = csv.writer(csv_file)
     writer.writerow(["link.get", "link.text"])
     csv_file.close()

以CSV格式输出

第1列:网址链接

第2列:职位描述

例如

第1列:/职业/管理顾问,以帮助我们的客户成功完成- 它/

Column 1: /career/management-consultants-to-help-our-customers-succeed-with- it/

第2栏:管理顾问,以帮助我们的客户在IT方面取得成功 哥本哈根•在实施咨询集团,我们希望在以下方面有所作为 咨询行业,因为我们相信创造变革的能力 拥有影响力是在日益全球化和全球化中取得成功的前提 动荡的世界.

Column 2: Management consultants to help our customers succeed with IT COPENHAGEN • At Implement Consulting Group, we wish to make a difference in the consulting industry, because we believe that the ability to create Change with Impact is a precondition for success in an increasingly global and turbulent world.

推荐答案

尝试以下脚本并获取csv输出:

Try this script and get the csv output:

import csv ; import requests
from bs4 import BeautifulSoup 

outfile = open('career.csv','w', newline='')
writer = csv.writer(outfile)
writer.writerow(["job_link", "job_desc"])

res = requests.get("http://implementconsultinggroup.com/career/#/6257").text
soup = BeautifulSoup(res,"lxml")
links = soup.find_all("a")

for link in links:
     if "career" in link.get("href") and 'COPENHAGEN' in link.text:
        item_link = link.get("href").strip()
        item_text = link.text.replace("View Position","").strip()
        writer.writerow([item_link, item_text])
        print(item_link, item_text)
outfile.close()

这篇关于将数据从BeautifulSoup导出到CSV的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆