删除重复的行 [英] Remove duplicated rows
本文介绍了删除重复的行的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
我已将 CSV
文件读入R data.frame。其中一列在其中一列具有相同的元素。我想删除该列中重复的行。例如:
I have read a CSV
file into an R data.frame. Some of the rows have the same element in one of the columns. I would like to remove rows that are duplicates in that column. For example:
platform_external_dbus 202 16 google 1
platform_external_dbus 202 16 space-ghost.verbum 1
platform_external_dbus 202 16 localhost 1
platform_external_dbus 202 16 users.sourceforge 8
platform_external_dbus 202 16 hughsie 1
我想只有其中一行,因为其他行在第一列中有相同的数据。
I would like only one of these rows since the others have the same data in the first column.
推荐答案
只是隔离您的数据框架到您需要的列,然后使用唯一的功能:D
just isolate your data frame to the columns you need, then use the unique function :D
# in the above example, you only need the first three columns
deduped.data <- unique( yourdata[ , 1:3 ] )
# the fourth column no longer 'distinguishes' them,
# so they're duplicates and thrown out.
这篇关于删除重复的行的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文