无法读取 pandas 中的英寸符号read_csv [英] failed to read inch symbol in pandas read_csv
本文介绍了无法读取 pandas 中的英寸符号read_csv的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
我的csv具有以下详细信息
I have csv with below details
Name,Desc,Year,Location
Jhon,12" Main Third ,2012,GR
Lew,"291" Line (12,596,3)",2012,GR
,All, 1992,FR
...
这是很长的文件.我只是显示有问题的台词.我很困惑如何尝试在Pandas数据框中读取它
It is very long file. i just showed problematic lines.I am confused how can i read it in Pandas data frame, I tried
-
quotechar,
quotechar,
引用,
9月
like属性. 仍然没有成功.
like attribute of pandas read_csv. Still no success.
我无法控制csv的设计方式.
I have no control on how csv is being designed.
推荐答案
您可以执行以下操作.尝试一下是否适合您:
You can do something like this. Try if this works for you:
import pandas as pd
import re
l1=[]
with open('/home/yusuf/Desktop/c1') as f:
headers = f.readline().strip('\n').split(',')
for a in f.readlines():
if a:
q = re.findall("^(\w*),(.*),\s?(\d+),(\w+)",a)
if q:
l1.append(q)
l2 = [list(b[0]) for b in l1]
df = pd.DataFrame(data=l2, columns=headers)
df
输出:
Regex演示: https://regex101.com/r/AU2WcO/1
这篇关于无法读取 pandas 中的英寸符号read_csv的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文