R:从成员资格更改列表中重新创建历史成员资格 [英] R: Recreate historical membership from a list of changes in membership

查看:49
本文介绍了R:从成员资格更改列表中重新创建历史成员资格的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

假设我有一个组的当前成员身份,即成员的姓名.此外,我有一些新成员可能已添加到群组和/或旧成员可能已从群组中删除的时间的数据.

Suppose I have the current membership status of a group, i.e. names of members. Additionally, I have data on times when some new member may have been added to the group and / or an old member may have been removed from the group.

手头的任务是在所有这些时间点重新创建组的成员资格.我环顾四周,但没有找到解决这个问题的现成解决方案.有人知道这样做的优雅方法吗?

The task at hand is to recreate the membership of the group at all these points in time. I've looked around but did not find a ready solution for this problem. Does anybody know an elegant method of doing this?

输入:

periods <- 5
indx <- paste0("t-", seq_len(periods))
[1] "t-1" "t-2" "t-3" "t-4" "t-5"

current <- letters[seq_len(10)]
[1] "a" "b" "c" "d" "e" "f" "g" "h" "i" "j"

incoming <- setNames(letters[seq_len(periods) + 5], indx)
incoming[2] <- NA
t-1 t-2 t-3 t-4 t-5
"f"  NA "h" "i" "j"

outgoing <- setNames(letters[seq_len(periods) + 10], indx)
outgoing[4] <- NA
t-1 t-2 t-3 t-4 t-5
"k" "l" "m"  NA "o"

输出:

$current
 [1] "a" "b" "c" "d" "e" "f" "g" "h" "i" "j"

$`t-1`
 [1] "a" "b" "c" "d" "e" "g" "h" "i" "j" "k"

$`t-2`
 [1] "a" "b" "c" "d" "e" "g" "h" "i" "j" "k" "l"

$`t-3`
 [1] "a" "b" "c" "d" "e" "g" "i" "j" "k" "l" "m"

$`t-4`
 [1] "a" "b" "c" "d" "e" "g" "j" "k" "l" "m"

$`t-5`
 [1] "a" "b" "c" "d" "e" "g" "k" "l" "m" "o"

<小时>

免责声明:我已经为此编写了一个解决方案,我将发布该解决方案作为我对该问题的回答.目的是记录这个问题和一个可能的解决方案,并引出其他巧妙的和/或现有的解决方案或改进.


Disclaimer: I've written a solution for this which I will be posting as my answer to the question. The intent is to document this problem and a possible solution and to elicit other ingenious and / or existing solutions or improvements.

推荐答案

create_mem_ts 函数(成员时间序列)将生成问题中发布的所需输出.

The function create_mem_ts (membership timeseries) will generate the desired output as posted in the question.

create_mem_ts <- function (ctime, added, removed, current) {

  # Create a time-series of membership of a set.

  # Inputs:

  ## ctime:     Time of changes in set.
  ##            An atomic vector of a time-series class or otherwise,
  ##
  ##            interpretable as a time-series in descending order (for e.g.
  ##            `t-1`, `t-2`, `t-3` etc.
  ##
  ##            Is an index of when the changes in membership happened in time.
  ##            Allows repeats but no NAs.

  ## added:     Member(s) added to the set.
  ##            An atomic vector or a list of the same length as ctime.
  ##
  ##            If an atomic vector, represents exactly one member added at
  ##            the corresponding ctime.
  ##
  ##            If a list, represents multiple members added at corresponding
  ##            ctime.

  ## removed:   Member(s) removed from the set.
  ##            An atomic vector or a list of the same length as ctime.
  ##
  ##            If an atomic vector, represents exactly one member removed at
  ##            the corresponding ctime.
  ##
  ##            If a list, represents multiple members removed at the
  ##            corresponding ctime.

  ## current:   Current membership of the set.
  ##            An atomic vector listing the current membership of the set.

  # Output:

  ## A list of the same length as ctime named by values in ctime (coerced to
  ## character by the appropriate method). 

  stopifnot(is.atomic(ctime),
            is.atomic(added) || is.list(added),
            is.atomic(removed) || is.list(removed))

  if (any(is.na(ctime))) stop("NAs not allowed in the ctime.")

  stopifnot(length(ctime) == length(added),
            length(added) == length(removed))

  if (any(duplicated(ctime))) {
    ctime.u <- unique(ctime)
    ctime.f <- factor(ctime, levels=as.character(ctime.u))
    added <- split(added, ctime.f)
    removed <- split(removed, ctime.f)
  } else {
    ctime.u <- ctime
  }

  out <- setNames(vector(mode="list", length=length(ctime.u) + 1),
                  c("current", as.character(ctime.u)))
  out[["current"]] <- current

  for (i in 2:length(out))
    out[[i]] <- union(setdiff(out[[i - 1]], added[[i - 1]]),
                      na.omit(removed[[i - 1]]))

  attr(out, "index") <- ctime.u

  out

}

此外,如果 ctime 是上述函数中的有效时间序列类,则该类的输出可用于使用该函数(在 ctime 范围内)生成任何时间戳的成员资格) 使用这个函数memship_at.

Moreover, if ctime is a valid time-series class in the function above, the output from that can be used to generate membership on any time-stamp using the function (within the range in ctime) using this function memship_at.

memship_at <- function (mem_ts, at) {

  stopifnot(inherits(at, class(attr(mem_ts, "index"))))

  just.before <- which(at > attr(mem_ts, "index"))[1]

  if (just.before > 1)
    mem_ts[[just.before - 1]]
  else
    mem_ts[[1]]

}

这篇关于R:从成员资格更改列表中重新创建历史成员资格的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆