R脚本:在data.table中使用shift函数时-错误:(列表)对象不能被强制键入"double" [英] R script: While using shift function in data.table - errror: (list) object cannot be coerced to type 'double'

查看:1495
本文介绍了R脚本:在data.table中使用shift函数时-错误:(列表)对象不能被强制键入"double"的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有data.table

I have data.table

set.seed(1)
dat <- data.table(Shift = c(c(0,0,0,1,2,1,1)),
              Value = rnorm(7),I.Value = rnorm(7))
dat
Shift      Value    I.Value
 0       -0.6264538  0.7383247
 0        0.1836433  0.5757814
 0       -0.8356286 -0.3053884
 1        1.5952808  1.5117812
 2        0.3295078  0.3898432
 1       -0.8204684 -0.6212406
 1        0.4874291 -2.2146999

我希望新列为shift(Value,Shift,fill = 0).因此结果应该是-

I want the new column to be shift(Value,Shift,fill=0). Hence the result should be-

  Shift      Value    I.Value    new.value   new.I.value
   0       -0.6264538  0.7383247  -0.6264538  0.7383247
   0        0.1836433  0.5757814   0.1836433  0.5757814
   0       -0.8356286 -0.3053884   -0.8356286 -0.3053884
   1        1.5952808  1.5117812    0           0
   2        0.3295078  0.3898432   1.5952808  1.5117812
   1       -0.8204684 -0.6212406    0           0
   1        0.4874291 -2.2146999   0.3295078  0.3898432

我到目前为止所拥有的:

What I have till now:

dat[,`:=` (new.value= shift(Value,Shift,fill = 0),
         new.I.value=shift(I.Value,Shift,fill = 0))]

但是shift函数返回一个列表,因为Shift参数的长度不是1,而是整个向量.因此,我遇到了错误:

But the shift function is returning a list as the parameter Shift is not of length 1, but the entire vector. Hence I face the error:

Error in `[.data.table`(dat[Shift == 0, `:=`(new = Value)], !Shift ==  : 
(list) object cannot be coerced to type 'double'

我尝试传递Shift [.I],但是也没有用.

I tried passing Shift[.I], but did not work either.

如何仅传递Shift列的行值,以便返回向量并获得正确的结果? 由于我的工作数据集非常庞大,因此我更喜欢使用data.table.

How do i pass just the row value of the column Shift, such that a vector is returned and I get the proper result? As my working dataset is pretty huge, I prefer working with data.table.

-上面函数生成的输出是-

-The output generated by the above function is -

   Shift       Value    I.Value grp       new2       new3
1:     0 -1.22461261  1.7672873   1 -1.2246126  1.7672873
2:     0 -0.47340064  0.7167075   1 -0.4734006  0.7167075
3:     0 -0.62036668  0.9101742   1 -0.6203667  0.9101742
4:     1  0.04211587  0.3841854   2  0.0000000  0.0000000
5:     2 -0.91092165  1.6821761   3  0.0000000  0.0000000
6:     1  0.15802877 -0.6357365   4  0.0000000  0.0000000
7:     1 -0.65458464 -0.4616447   4  0.1580288 -0.6357365

第4行应作为dat$Shift[4]==1移至第5行,第5行应作为dat$Shift[5]==2移至第7行.如果我们只是写-

4th row should have been shifted to 5th as dat$Shift[4]==1 and 5th row should have been shifted to 7th as dat$Shift[5]==2. If we simply write-

dat[,new:=shift(Value,2,fill=0),new.I:=shift(I.Value,2,fill=0)]

将所有行向下移动2个,因为n = 2,而不是硬编码值,n必须是当前正在处理的行的"Shift"列中存在的值.

will shift all rows down by 2 as n=2, just instead of a hardcoded value, n has to be the value present in the column 'Shift' for current row under process.

推荐答案

这项工作吗?

请注意,我优先考虑第5行中的值,因为第6行中的值将移至相同位置.

Note that I give priority to the value in row 5, as the value in row 6 would be shifted to the same position.

shift2 <- function(val,shift,fill){
  new_val <- rep(fill,length(val))
  indices <- (1:length(val))+shift
  indices <- indices[indices <= nrow(dat)]
  new_val[rev(indices)]= val[length(indices):1]
  new_val
}

dat[,`:=` (new.value= shift2(Value,Shift,fill = 0),
           new.I.value=shift2(I.Value,Shift,fill = 0))]

# > dat
#    Shift      Value    I.Value  new.value new.I.value
# 1:     0 -0.6264538  0.7383247 -0.6264538   0.7383247
# 2:     0  0.1836433  0.5757814  0.1836433   0.5757814
# 3:     0 -0.8356286 -0.3053884 -0.8356286  -0.3053884
# 4:     1  1.5952808  1.5117812  0.0000000   0.0000000
# 5:     2  0.3295078  0.3898432  1.5952808   1.5117812
# 6:     1 -0.8204684 -0.6212406  0.0000000   0.0000000
# 7:     1  0.4874291 -2.2146999  0.3295078   0.3898432

如果您想将第6行优先于第5行:

If you want to prioritize row 6 over row 5 :

shift3 <- function(val,shift,fill){
  new_val <- rep(fill,length(val))
  indices <- (1:length(val))+shift
  indices <- indices[indices <= nrow(dat)]
  new_val[indices]= val[1:length(indices)]
  new_val
}

dat[,`:=` (new.value= shift3(Value,Shift,fill = 0),
          new.I.value=shift3(I.Value,Shift,fill = 0))]
# 
# Shift      Value    I.Value  new.value new.I.value
# 1:     0 -0.6264538  0.7383247 -0.6264538   0.7383247
# 2:     0  0.1836433  0.5757814  0.1836433   0.5757814
# 3:     0 -0.8356286 -0.3053884 -0.8356286  -0.3053884
# 4:     1  1.5952808  1.5117812  0.0000000   0.0000000
# 5:     2  0.3295078  0.3898432  1.5952808   1.5117812
# 6:     1 -0.8204684 -0.6212406  0.0000000   0.0000000
# 7:     1  0.4874291 -2.2146999 -0.8204684  -0.6212406

这篇关于R脚本:在data.table中使用shift函数时-错误:(列表)对象不能被强制键入"double"的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆