如何用另一个数据框的值替换变量的NA [英] How to replace NAs of a variable with values from another dataframe

查看：86 发布时间：2020/5/6 9:22:08 r match dplyr na

本文介绍了如何用另一个数据框的值替换变量的NA的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我希望这不是愚蠢的.

我有两个具有变量ID和性别/性别的数据框.在df1中，有NA.在df2中，变量已完成.我想用df2中的值来完成df1中的列. (在df1中，该变量称为性别".在df2中，其称为性别".)

I have two dataframes with Variables ID and gender/sex. In df1, there are NAs. In df2, the variable is complete. I want to complete the column in df1 with the values from df2. (In df1 the variable is called "gender". In df2 it is called "sex".)

这是我到目前为止尝试过的:

Here is what i tried so far:

#example-data
ID<-seq(1,30,by=1)
df1<-as.data.frame(ID)
df2<-df1
df1$gender<-c(NA,"2","1",NA,"2","2","2","2","2","2",NA,"2","1","1",NA,"2","2","2","2","2","1","2","2",NA,"2","2","2","2","2",NA)
df2$sex<-c("2","2","1","2","2","2","2","2","2","2","2","2","1","1","2","2","2","2","2","2","1","2","2","2","2","2","2","2","2","2")


#Approach 1: 
NAs.a <- is.na(df1$gender)
df1$gender[NAs.a] <- df2[match(df1$ID[NAs.a], df2$ID),]$sex

#Approach 2 (i like dplyr a lot, perhaps there´s a way to use it):
library("dplyr")
temp<-df2 %>% select(ID,gender)

#EDIT:
#df<-left_join(df1$gender,df2$gender, by="ID") 
 df<-left_join(df1,df2, by="ID")

非常感谢您.

推荐答案

下面是使用data.table s二进制连接的快速解决方案，它将使用sex仅连接 gender并保留所有其余各栏保持不变

Here's a quick solution using data.tables binary join this will join only gender with sex and leave all the rest of the columns untouched

library(data.table)
setkey(setDT(df1), ID)
df1[df2, gender := i.sex][]
#     ID gender
#  1:  1      2
#  2:  2      2
#  3:  3      1
#  4:  4      2
#  5:  5      2
#  6:  6      2
#  7:  7      2
#  8:  8      2
#  9:  9      2
# 10: 10      2
# 11: 11      2
# 12: 12      2
# 13: 13      1
# 14: 14      1
# 15: 15      2
# 16: 16      2
# 17: 17      2
# 18: 18      2
# 19: 19      2
# 20: 20      2
# 21: 21      1
# 22: 22      2
# 23: 23      2
# 24: 24      2
# 25: 25      2
# 26: 26      2
# 27: 27      2
# 28: 28      2
# 29: 29      2
# 30: 30      2

这篇关于如何用另一个数据框的值替换变量的NA的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

如何用另一个数据框的值替换变量的NA [英] How to replace NAs of a variable with values from another dataframe

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

如何用另一个数据框的值替换变量的NA [英] How to replace NAs of a variable with values from another dataframe

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭