数据框中一行中的重复值 [英] Duplicate values in a single row in dataframe
本文介绍了数据框中一行中的重复值的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
df <- data.frame(label = c("a","b","c"),
val=c("x","b","c"),
val1=c("z","b","d"))
label val val1
1 a x z
2 b b b
3 c c d
我想找出每一行中的重复值. 对于第一行,没有重复 对于第二行,"b"重复 对于第三行,"c"重复. 如何在R编程中找到此重复项.
I want find out the duplicate values in each row. for 1st row, there is no duplicate for 2nd row , "b" is duplicate for 3rd row, "c" is duplicate. How to find this duplicate in R programming.
我还需要将重复元素替换为NA
值.
Also I need to replace the duplicate elements with NA
value.
推荐答案
将duplicated
与apply
apply(df,1,duplicated)
[,1] [,2] [,3]
[1,] FALSE FALSE FALSE
[2,] FALSE TRUE TRUE
[3,] FALSE TRUE FALSE
并将其替换为NA
df[t(apply(df,1,duplicated))]=NA
df
label val val1
1 a x z
2 b <NA> <NA>
3 c <NA> d
这篇关于数据框中一行中的重复值的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文