给定开始日期和结束日期,请调整/扩展每天之间(连续一天)的数据 [英] Given start date and end date, reshape/expand data for each day between (each day on a row)
本文介绍了给定开始日期和结束日期,请调整/扩展每天之间(连续一天)的数据的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
我花了很多时间来获得R中的每个不同天数:
I have spent much time to get each difference days in R:
start <- as.Date(c("2013-02-26", "2013-03-26","2013-04-01","2013-04-26","2013-05-26"))
end <- as.Date(c("2013-03-25","2013-03-31","2013-04-25","2013-05-25","2013-06-25"))
per_cost <- c(3451380,3767052,3726900,4076868,3575311)
x <- data.frame(START_DAY=start, END_DAY=end, PER_COST=per_cost)
x$DIF_DAYS<- x$END_DAY-x$START_DAY
然后,我得到了:
START_DAY END_DAY PER_COST DIF_DAYS
1 2013-02-26 2013-03-25 3451380 27 days
2 2013-03-26 2013-03-31 3767052 5 days
3 2013-04-01 2013-04-25 3726900 24 days
4 2013-04-26 2013-05-25 4076868 29 days
5 2013-05-26 2013-06-25 3575311 30 days
我想获得以下输出:
DATE PER_COST
2013-02-26 3451380
2013-02-27 3451380
2013-02-28 3451380
2013-02-29 3451380
...
2013-03-25 3451380
2013-03-26 3767052
2013-03-27 3767052
2013-03-28 3767052
该怎么做?
推荐答案
使用 data.table
library(data.table)
setDT(x)[, list(DATE=seq(START_DAY, END_DAY, by = 'day')), PER_COST]
# PER_COST DATE
# 1: 3451380 2013-02-26
# 2: 3451380 2013-02-27
# 3: 3451380 2013-02-28
# 4: 3451380 2013-03-01
# 5: 3451380 2013-03-02
#---
#116: 3575311 2013-06-21
#117: 3575311 2013-06-22
#118: 3575311 2013-06-23
#119: 3575311 2013-06-24
#120: 3575311 2013-06-25
如果重复的 PER_COST
,则最好使用 1:nrow(x )
作为分组变量
If there are duplicate PER_COST
, then it may be better to use 1:nrow(x)
as the grouping variable
setDT(x)[, list(DATE=seq(START_DAY, END_DAY, by = 'day'),
PER_COST=rep(PER_COST, END_DAY-START_DAY+1)), 1:nrow(x)]
更新
使用 dplyr
library(dplyr)
x %>%
rowwise() %>%
do(data.frame(DATE=seq(.$START_DAY, .$END_DAY, by='day'),
PER_COST= rep(.$PER_COST, .$END_DAY-.$START_DAY+1)))
这篇关于给定开始日期和结束日期,请调整/扩展每天之间(连续一天)的数据的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文