R 中嵌套 ifelse 语句的替代方案 [英] Alternatives to nested ifelse statements in R
问题描述
假设我们有以下数据.行代表一个国家,而列 (in05:in09
) 表示该国家/地区是否在给定年份 (2005:2009
) 中存在于感兴趣的数据库中.>
Suppose we have the following data. The rows represent a country and the columns (in05:in09
) indicate whether that country was present in a database of interest in the given year (2005:2009
).
id <- c("a", "b", "c", "d")
in05 <- c(1, 0, 0, 1)
in06 <- c(0, 0, 0, 1)
in07 <- c(1, 1, 0, 1)
in08 <- c(0, 1, 1, 1)
in09 <- c(0, 0, 0, 1)
df <- data.frame(id, in05, in06, in07, in08, in09)
我想创建一个变量 firstyear
来指示该国家/地区出现在数据库中的第一年.现在我执行以下操作:
I want to create a variable firstyear
which indicates the first year in which the country was present in the database. Right now I do the following:
df$firstyear <- ifelse(df$in05==1,2005,
ifelse(df$in06==1,2006,
ifelse(df$in07==1, 2007,
ifelse(df$in08==1, 2008,
ifelse(df$in09==1, 2009,
0)))))
上面的代码已经不是很好了,我的数据集包含了很多年.是否有替代方法,使用 *apply
函数、循环或其他方法来创建这个 firstyear
变量?
The above code is already not very nice, and my dataset contains many more years. Is there an alternative, using *apply
functions, loops or something else, to create this firstyear
variable?
推荐答案
您可以使用 max.col
indx <- names(df)[max.col(df[-1], ties.method = "first") + 1L]
df$firstyear <- as.numeric(sub("in", "20", indx))
df
# id in05 in06 in07 in08 in09 firstyear
# 1 a 1 0 1 0 0 2005
# 2 b 0 0 1 1 0 2007
# 3 c 0 0 0 1 0 2008
# 4 d 1 1 1 1 1 2005
这篇关于R 中嵌套 ifelse 语句的替代方案的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!