R编程:“比列名更多的列” [英] R Programming: "More Columns than Column Names"

查看:2161
本文介绍了R编程:“比列名更多的列”的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我试图在IRS中的 R 中读取一个.csv文件,它似乎没有以任何奇怪的方式格式化,我只是使用read.table功能,因为我已经使用了一百万次。我找不到任何解决方案已经可用的这个问题,似乎帮助我。这里是我使用的代码:

I'm trying to read in a .csv file in R from the IRS and it doesn't appear to be formatted in any weird way, and I'm just using the read.table function as I've used a million times before. I can't find any solutions already available to this problem that seem to help me. Here's the code I'm using:

data_0910<-read.table("/Users/blahblahblah/countyinflow0910.csv",header=T,stringsAsFactors=FALSE,colClasses="character")

Error in read.table("/Users/blahblahblah/countyinflow0910.csv",  : 
  more columns than column names

为什么要这样做?如果有帮助,可以找到.csv文件:

Why is it doing this? If it helps, the .csv files can be found at:

http://www.irs.gov/uac/SOI-Tax-Stats-County-to-County-Migration-Data-Files

推荐答案

它使用逗号因为你可以设置 sep =,或者只是使用 read.csv

It uses commas as separators. So you can either set sep="," or just use read.csv:

x <- read.csv(file="http://www.irs.gov/file_source/pub/irs-soi/countyinflow1011.csv")
dim(x)
## [1] 113593      9

错误是由一些值中的空格和不匹配的引号引起的。标题中没有空格,因此 read.table 认为有一列。然后它认为它看到一些行中的多个列。例如,前两行(标题和第一行):

The error is caused by spaces in some of the values, and unmatched quotes. There are no spaces in the header, so read.table thinks that there is one column. Then it thinks it sees multiple columns in some of the rows. For example, the first two lines (header and first row):

State_Code_Dest,County_Code_Dest,State_Code_Origin,County_Code_Origin,State_Abbrv,County_Name,Return_Num,Exmpt_Num,Aggr_AGI
00,000,96,000,US,Total Mig - US & For,6973489,12948316,303495582

和不匹配的引号,例如第1336行(第1335行)将 read.table 与默认的引用参数混淆(但不是 read.csv ):

And unmatched quotes, for example on line 1336 (row 1335) which will confuse read.table with the default quote argument (but not read.csv):

01,089,24,033,MD,Prince George's County,13,30,1040

这篇关于R编程:“比列名更多的列”的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆