无法读取 pandas 中的英寸符号read_csv [英] failed to read inch symbol in pandas read_csv

查看:138
本文介绍了无法读取 pandas 中的英寸符号read_csv的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我的csv具有以下详细信息

I have csv with below details

Name,Desc,Year,Location

Jhon,12" Main Third ,2012,GR

Lew,"291" Line (12,596,3)",2012,GR

,All, 1992,FR

...

这是很长的文件.我只是显示有问题的台词.我很困惑如何尝试在Pandas数据框中读取它

It is very long file. i just showed problematic lines.I am confused how can i read it in Pandas data frame, I tried

  • quotechar,

  • quotechar,

引用,

9月

like属性. 仍然没有成功.

like attribute of pandas read_csv. Still no success.

我无法控制csv的设计方式.

I have no control on how csv is being designed.

推荐答案

您可以执行以下操作.尝试一下是否适合您:

You can do something like this. Try if this works for you:

import pandas as pd
import re

l1=[]
with open('/home/yusuf/Desktop/c1') as f:
    headers = f.readline().strip('\n').split(',')
    for a in f.readlines():
        if a:
            q = re.findall("^(\w*),(.*),\s?(\d+),(\w+)",a)
            if q:
                l1.append(q)

l2 = [list(b[0]) for b in l1]

df = pd.DataFrame(data=l2, columns=headers)
df

输出:

Regex演示: https://regex101.com/r/AU2WcO/1

这篇关于无法读取 pandas 中的英寸符号read_csv的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆