重命名R中的重复字符串 [英] Renaming duplicate strings in R

查看：149 发布时间：2017/7/20 23:52:52 r duplicates rename

本文介绍了重命名R中的重复字符串的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个R数据帧，它有两列字符串。在其中一列（例如Column1）中，有重复的值。我需要重新标记该列，以便重复的字符串将重命名为有序后缀，如Column1.new

  Column1 Column2 Column1.new 
 1 A 1_1 
 1 B 1_2 
 2 C 2_1 
 2 D 2_2 
 3 E 3 
 4 F 4

任何想法如何做到这一点将不胜感激。

cheers，

Antti

解决方案

通过 Column1 ）在一个名为选项卡的对象内。首先创建一个运行对象

  c1.rle<  -  rle（tab $ Column1）
 c1.rle 
 ## length：int [1：4] 2 2 1 1 
 ##值：int [1：4] 1 2 3 4

这样就可以获得 Column1 的值和每个元素的相应数量。然后使用该信息创建具有唯一标识符的新列：

  tab $ Column1.new<  -  paste0（rep（c1 .rle $ values，times = c1.rle $ length），_，
 unlist（lapply（c1.rle $ length，seq_len）））

不确定，如果这是适合您的情况，但您也可以粘贴在一起 Column1 和 Column2 ，以创建一个唯一的标识符...

I have an R dataframe that has two columns of strings. In one of the columns (say, Column1) there are duplicate values. I need to relabel that column so that it would have the duplicated strings renamed with ordered suffixes, like in the Column1.new

 Column1   Column2   Column1.new
 1         A         1_1
 1         B         1_2
 2         C         2_1
 2         D         2_2
 3         E         3
 4         F         4

Any ideas of how to do this would be appreciated.

Cheers,

Antti

解决方案

Let's say your data (ordered by Column1) is within an object called tab. First create a run length object

c1.rle <- rle(tab$Column1)
c1.rle
##lengths: int [1:4] 2 2 1 1
##values : int [1:4] 1 2 3 4

That gives you values of Column1 and the according number of appearences of each element. Then use that information to create the new column with unique identifiers:

tab$Column1.new <- paste0(rep(c1.rle$values, times = c1.rle$lengths), "_",
        unlist(lapply(c1.rle$lengths, seq_len)))

Not sure, if this is appropriate in your situation, but you could also just paste together Column1 and Column2, to create an unique identifier...

这篇关于重命名R中的重复字符串的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

重命名R中的重复字符串 [英] Renaming duplicate strings in R

问题描述

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

重命名R中的重复字符串 [英] Renaming duplicate strings in R

问题描述

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭