加载数据问题 [英] Loading data issues
问题描述
数据链接:
数据
代码:
ccfsisims< - read.csv(file =F:/ Purdue University /RA_Position/PhD_ResearchandDissert/PhD_Draft/GTAP-CGE/GTAP_NewAggDatabase/NewFiles/GTAP_ConsIndex.csv,header = TRUE,sep =,,na.string =NA,dec =。,strip.white = TRUE)
ccfsirsts< - as.data.frame(ccfsisims)
ccfsirsts [7:25]< - sapply(ccfsirsts [7:25],as.numeric)
ccfsirsts< $ cc $ b ccbirsts [1:5,]
问题:
因此,如果您检查列pSVIPM显示在数据框ccfsirsts中与实际保存在.csv文件中的不同。在上传不同的数据集时出现此问题。
在初始上传,即ccfsisims,一切似乎都结帐。
任何想到为什么会发生这种情况?
解决方案当你加载ccfsisims
dostr(ccfsisims)
... this)
您会看到
pSVIPM
是一个因素。因此,as.numeric
只会按照级别显示的顺序将因子更改为数字。
在您的csv您有
#DIV / 0!
字符。
>长度(ccfsisims $ pSVIPM [ccfsisims $ pSVIPM ==#DIV / 0!])
[1] 350
Datalink: Data
Code:
ccfsisims <- read.csv(file = "F:/Purdue University/RA_Position/PhD_ResearchandDissert/PhD_Draft/GTAP-CGE/GTAP_NewAggDatabase/NewFiles/GTAP_ConsIndex.csv", header=TRUE, sep=",", na.string="NA", dec=".", strip.white=TRUE) ccfsirsts <- as.data.frame(ccfsisims) ccfsirsts[7:25] <- sapply(ccfsirsts[7:25],as.numeric) ccfsirsts <- droplevels(ccfsirsts) ccfsirsts <- transform(ccfsirsts,sres=factor(sres,levels=unique(sres))) ccfsirsts[1:5,]
Issue:
So, if you check the column "pSVIPM", the values displayed in the dataframe "ccfsirsts" are different from what is actually saved in the .csv file. This problem occured when uploading a different set of data.
In the initial upload, i.e. "ccfsisims", everything seems to check out. It is afterward that the problem occurs.
Any thoughts on why this happens?
解决方案when you load
ccfsisims
dostr(ccfsisims )
...(get in the habit of doing this)you will see that
pSVIPM
is a factor. Soas.numeric
will simply change the factors to numbers in the order the levels appear.Because if you look at your csv you have
#DIV/0!
characters in there.try it yourself:
> length(ccfsisims$pSVIPM[ccfsisims$pSVIPM == "#DIV/0!"]) [1] 350
这篇关于加载数据问题的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文