使用python从字符串列表中提取字符串 [英] using python extract string from a list of string

查看:75
本文介绍了使用python从字符串列表中提取字符串的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

当我使用以下代码读取 CSV 标头时

def read_header(gram):CSV_tag_read = csv.reader(open("Dataset/MALE_Training/"+str(gram)+"/"+str(gram)+".csv","rb"),delimiter='"')标头=CSV_tag_read.next()

标题(csv文件的第一行)如下

Sample1"('CC', 'CD')"('CD', 'DT')"('DT', 'EX')"('EX', 'IN')"('IN', 'JJ')"('JJ', 'JJR')"('JJR', 'JJS')"('JJS', 'MD')"('MD', 'NN')"('NN', 'NNS')"('NNS', 'PRP')"('PRP', 'RB')"('RB', 'RBR')"('RBR', 'RP')"('RP', 'TO')"('TO', 'VB')"('VB', 'VBD')"('VBD', 'VBG')"('VBG', 'VBN')"('VBN', 'VBP')"('VBP', 'VBZ')"('VBZ', 'WDT')"('WDT', 'WP')"('WP', 'WRB')

我希望我的输出作为列表包含

['Sample1', '(CC, CD)', '(CD, DT)', '(DT, EX)', '(EX, IN)', '(IN, JJ)', '(JJ, JJR)', '(JJR, JJS)', '(JJS, MD)', '(MD, NN)', '(NN, NNS)', '(NNS, PRP)', '(PRP, RB)', '(RB, RBR)', '(RBR, RP)', '(RP, TO)', '(TO, VB)', '(VB, VBD)', '(VBD), VBG)', '(VBG, VBN)', '(VBN, VBP)', '(VBP, VBZ)', '(VBZ, WDT)', '(WDT, WP)', '(WP, WRB))']

解决方案

使用 ast.literal_eval:

<预><代码>>>>进口AST>>>a = "('TO', 'RP', 'TO')">>>b = ast.literal_eval(a)>>>乙('TO', 'RP', 'TO')>>>乙[0]'至'

<预><代码>>>>[x.strip("' ") for x in a.strip('()').split(',')]['到','RP','到']

UPDATED ANSWER 对应于更新的问题:

 with open('/path/to/csv-file.csv') as f:reader = csv.reader(f, delimiter='"')header = [column.replace("'", "") 用于标题中的列]

when i was reading CSV header using following code

def  read_header(gram):
    CSV_tag_read = csv.reader(open("Dataset/MALE_Training/"+str(gram)+"/"+str(gram)+".csv","rb"),delimiter='"')
    header=CSV_tag_read.next()

The header (The first line of the csv file) will be as follows

Sample1"('CC', 'CD')"('CD', 'DT')"('DT', 'EX')"('EX', 'IN')"('IN', 'JJ')"('JJ', 'JJR')"('JJR', 'JJS')"('JJS', 'MD')"('MD', 'NN')"('NN', 'NNS')"('NNS', 'PRP')"('PRP', 'RB')"('RB', 'RBR')"('RBR', 'RP')"('RP', 'TO')"('TO', 'VB')"('VB', 'VBD')"('VBD', 'VBG')"('VBG', 'VBN')"('VBN', 'VBP')"('VBP', 'VBZ')"('VBZ', 'WDT')"('WDT', 'WP')"('WP', 'WRB')

i want my output as list contains

['Sample1', '(CC, CD)', '(CD, DT)', '(DT, EX)', '(EX, IN)', '(IN, JJ)', '(JJ, JJR)', '(JJR, JJS)', '(JJS, MD)', '(MD, NN)', '(NN, NNS)', '(NNS, PRP)', '(PRP, RB)', '(RB, RBR)', '(RBR, RP)', '(RP, TO)', '(TO, VB)', '(VB, VBD)', '(VBD, VBG)', '(VBG, VBN)', '(VBN, VBP)', '(VBP, VBZ)', '(VBZ, WDT)', '(WDT, WP)', '(WP, WRB)']

解决方案

Using ast.literal_eval:

>>> import ast
>>> a = "('TO', 'RP', 'TO')"
>>> b = ast.literal_eval(a)
>>> b
('TO', 'RP', 'TO')
>>> b[0]
'TO'

Or

>>> [x.strip("' ") for x in a.strip('()').split(',')]
['TO', 'RP', 'TO']

UPDATED ANSWER that corresponding to updated question:

with open('/path/to/csv-file.csv') as f:
    reader = csv.reader(f, delimiter='"')
    header = [column.replace("'", "") for column in header]

这篇关于使用python从字符串列表中提取字符串的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆