sapply 与 lapply 在读取文件并绑定它们时 [英] sapply vs. lapply while reading files and rbind'ing them

查看：39 发布时间：2021/7/14 20:09:19 r sapply rbind read.csv

本文介绍了sapply 与 lapply 在读取文件并绑定它们时的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我关注了 Hadley 的帖子:使用 rbind 将多个 .csv 文件加载到 R 中的单个数据帧中的问题读取多个 CSV 文件，然后将它们转换为一个数据帧.我还尝试了 lapply 与 sapply 的对比，如分组函数(tapply、by、aggrega)和*apply族.

I followed Hadley's thread: Issue in Loading multiple .csv files into single dataframe in R using rbind to read multiple CSV files and then convert them to one dataframe. I also experimented with lapply vs. sapply as discussed on Grouping functions (tapply, by, aggregate) and the *apply family.

这是我的第一个 CSV 文件:

Here's my first CSV file:

dput(File1)
structure(list(First.Name = structure(c(1L, 2L, 1L, 1L, 1L), .Label = c("A", 
"C"), class = "factor"), Last.Name = structure(c(1L, 2L, 2L, 
2L, 2L), .Label = c("B", "D"), class = "factor"), Income = c(55L, 
23L, 34L, 45L, 44L), Tax = c(23L, 21L, 22L, 24L, 25L), Location = structure(c(3L, 
3L, 1L, 4L, 2L), .Label = c("Americas", "AP", "EMEA", "LATAM"
), class = "factor")), .Names = c("First.Name", "Last.Name", 
"Income", "Tax", "Location"), class = "data.frame", row.names = c(NA, 
-5L))

这是我的第二个 CSV 文件:

Here's my second CSV file:

dput(File2)
structure(list(First.Name = structure(c(1L, 2L, 1L, 1L, 1L), .Label = c("A", 
"C"), class = "factor"), Last.Name = structure(c(1L, 2L, 2L, 
2L, 2L), .Label = c("B", "D"), class = "factor"), Income = c(55L, 
55L, 55L, 55L, 55L), Tax = c(24L, 24L, 24L, 24L, 24L), Location = structure(c(3L, 
3L, 1L, 4L, 2L), .Label = c("Americas", "AP", "EMEA", "LATAM"
), class = "factor")), .Names = c("First.Name", "Last.Name", 
"Income", "Tax", "Location"), class = "data.frame", row.names = c(NA, 
-5L))

这是我的代码:

dat1 <-",First.Name,Last.Name,Income,Tax,Location\n1,A,B,55,23,EMEA\n2,C,D,23,21,EMEA\n3,A,D,34,22,Americas\n4,A,D,45,24,LATAM\n5,A,D,44,25,AP"
dat2 <-",First.Name,Last.Name,Income,Tax,Location\n1,A,B,55,24,EMEA\n2,C,D,55,24,EMEA\n3,A,D,55,24,Americas\n4,A,D,55,24,LATAM\n5,A,D,55,24,AP"

tc1 <- textConnection(dat1)
tc2 <- textConnection(dat2)

merged_file <- do.call(rbind, lapply(list(tc1,tc2), read.csv))

虽然这很好用，但我想将 lapply 更改为 sapply.从上面的线程中，我意识到 sapply 会将读取因子从 csv 文件更改为矩阵，但我不确定为什么翻转字段.例如，Income 字段占用第 3 行和第 8 行，但不在一列中.

While this works beautifully, I wanted to change lapply to sapply. From the above thread, I realize that sapply would change the read factors from csv file to matrices, but I am unsure why the fields are flipped. For instance, Income field occupies row#3 and row#8, but are not in one column.

代码如下:

tc1 <- textConnection(dat1)
tc2 <- textConnection(dat2)

# change lapply to sapply    
merged_file <- do.call(rbind, sapply(list(tc1,tc2), read.csv))

输出如下:

    [,1] [,2] [,3] [,4] [,5]
 [1,]    1    2    1    1    1
 [2,]    1    2    2    2    2
 [3,]   55   23   34   45   44
 [4,]   23   21   22   24   25
 [5,]    3    3    1    4    2
 [6,]    1    2    1    1    1
 [7,]    1    2    2    2    2
 [8,]   55   55   55   55   55
 [9,]   24   24   24   24   24
[10,]    3    3    1    4    2

我很感激任何帮助.我对 R 相当陌生，不确定发生了什么.

I'd appreciate any help. I am fairly new to R and not sure what's going on.

sapply 与 lapply 在读取文件并绑定它们时 [英] sapply vs. lapply while reading files and rbind'ing them

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

sapply 与 lapply 在读取文件并绑定它们时 [英] sapply vs. lapply while reading files and rbind&#39;ing them

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

sapply 与 lapply 在读取文件并绑定它们时 [英] sapply vs. lapply while reading files and rbind'ing them

登录关闭