如何将多列值合并为一列? [英] How to merge multiple columns values into one column?

查看:94
本文介绍了如何将多列值合并为一列?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有一个称为"stemmoutput"的数据框(见下文):

I have a data frame called "stemmoutput" (see below) :

     X1      X2       X3      X4      X5      X6      X7     X8     X9    X10     
1  tanaman  cabai                                    
2  banget   hama     sakit   tanaman                            
3  koramil  nogosari melaks  ecek     hama   tanaman padi    ppl    ds   rambun

我想将多个列的值合并成这样的一列:

And I want to merge multiple columns values into one column like this :

     TEXT
1  tanaman cabai                                     
2  banget hama sakit tanaman                            
3  koramil nogosari melaks ecek hama tanaman padi ppl ds rambun 

我已经尝试过此代码,并且可以正常工作

I have tried this code, and it works

stemmoutput$TEXT <- with(stemmoutput, paste(X1,X2,X3,X4,X5,X6,X7,X8,X9,X10, sep=" "))

但是还有其他更有效的方法,而不必逐一写下列名吗?

but is there any other way that is more efficient, without having to write down the name of the column one by one?

我也像下面这样尝试过此代码,但这也不起作用.

I've also tried this code like below but that didn't work either.

for(i in names(stemmoutput)){
     stemmoutput$TEXT <- with(stemmoutput, paste(i, sep=" "))}

推荐答案

尝试do.call

library(stringr)
newdat <- data.frame(TEXT=str_trim(do.call(paste, stemmoutput)),
                     stringsAsFactors=FALSE)

newdat
#                                                         TEXT
#1                                                tanaman cabai
#2                                    banget hama sakit tanaman
#3 koramil nogosari melaks ecek hama tanaman padi ppl ds rambun

如果一列中包含多个部分的单词,最好使用,作为分隔符

It may be better to use , as delimiter if there are multi-part words within a column

 TEXT <- gsub(', [^A-Za-z]+', '', do.call(paste, c(stemmoutput, sep=', ')))

 newdat <- data.frame(TEXT, stringsAsFactors=FALSE)
 newdat
 #                                                                  TEXT
 #1                                                        tanaman, cabai
 #2                                          banget, hama, sakit, tanaman
 #3 koramil, nogosari, melaks, ecek, hama, tanaman, padi, ppl, ds, rambun

这篇关于如何将多列值合并为一列?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆