根据两列的字符串长度选择行 [英] Selecting rows on basis of the string length of two columns

查看：169 发布时间：2020/7/6 20:13:44 r select dataframe row

本文介绍了根据两列的字符串长度选择行的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我想选择一个数据帧的行，其中第v3列中的字符串的长度等于第v4列中的字符串的长度. 我的数据框"df"看起来像:

I want to select the rows of a data frame in which the length of the string in the column v3 is equal to the length of the string of the column v4. My dataframe 'df' looks like:

    v1  v2  v3  v4
1   456 .   C   T
2   462 .   C   T
3   497 .   C   T
4   499 .   GC  AC
5   499 .   GC  G
6   499 .   GC  CC
7   513 .   GCACA   GCA
8   513 .   GCACA   GCACACA
9   513 .   GCACA   ACACA
10  513 .   GCACA   GCACACACA
11  513 .   GCACA   GCACACACACA

12  513 .   GCACA   GACCACA
13  513 .   GCACA   G
14  521 .   ACN A
15  522 .   CNN C

输出应为:

v1  v2  v3  v4
1   456 .   C   T
2   462 .   C   T
3   497 .   C   T
4   499 .   GC  AC
9   513 .   GCACA   ACACA

我已经尝试过:
new_df = df[nchar(str_sub(df$v3))==nchar(str_sub(df$v4))]

推荐答案

@agstudy是最重要的部分.我要补充一点，str_sub(我假设来自stringr包)在这里没有做任何有用的事情.最后，您可以使用subset来避免重复使用df$.因此，您可以这样做:

@agstudy got the most important part. I would add that str_sub (from the stringr package I assume) is not doing anything useful here. Last, you could use subset to avoid the repetitive use of df$. So you can do:

df[nchar(df$v3) == nchar(df$v4), ]

或

subset(df, nchar(v3) == nchar(v4))

这篇关于根据两列的字符串长度选择行的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

根据两列的字符串长度选择行 [英] Selecting rows on basis of the string length of two columns

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

根据两列的字符串长度选择行 [英] Selecting rows on basis of the string length of two columns

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭