如何在 pandas 中读取固定宽度格式的文本文件 [英] How do I read a fix width format text file in pandas
问题描述
我刚接触到熊猫,正在弄清楚如何读取文件.该文件来自WRDS数据库,并且可以追溯到1960年代的SP500组成部分.我检查了文件,无论我如何使用"read_csv"将其导入,我仍然无法正确显示数据.
I just got my hands on pandas and am figuring out how I can read a file. The file is from WRDS database and is the SP500 constituents list all the way back to 1960s. I checked the file and no matter what I do to import it using 'read_csv', i still cant display the data correctly.
df = read_csv('sp500-sb.txt')
df
<class 'pandas.core.frame.DataFrame'>
Int64Index: 1231 entries, 0 to 1230
Data columns: gvkeyx from thru conm
gvkey co_conm
...(the column names)
dtypes: object(1)
以上输出内容是什么意思?一切都会有帮助的
What does the above chunk of output mean? Anything would be helpful
推荐答案
我们通过电子邮件回复了我.干杯.
Wes answered me in an email. Cheers.
这是一个固定宽度格式的文件(不以逗号或制表符分隔, 通常).我意识到熊猫没有像这样的定宽阅读器 R可以,尽管可以很容易地完成.我会尽力而为 做.同时,如果您可以导出其他格式的数据(例如 csv-用逗号分隔),您可以使用read_csv读取它.一世 怀疑您是否拥有一些Unix魔术,可以将FWF文件转换为CSV文件 文件.
This is a fixed-width-format file (not delimited by commas or tabs as usual). I realize that pandas does not have a fixed-width reader like R does, though one can be fashioned very easily. I'll see what I can do. In the meantime if you can export the data in another format (like csv--truly comma separated) you'll be able to read it with read_csv. I suspect with some unix magic you can transform a FWF file into a CSV file.
我建议您在github上关注该问题,因为您的电子邮件即将 从我的收件箱中消失:)
I recommend following the issue on github as your e-mail is about to disappear from my inbox :)
https://github.com/pydata/pandas/issues/920
最好, 韦斯
这篇关于如何在 pandas 中读取固定宽度格式的文本文件的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!