自动公式构建 [英] Automated formula construction

查看：106 发布时间：2020/5/10 19:16:22 r modeling

本文介绍了自动公式构建的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在最近的一项作业中，我们被要求运行27个线性模型，每次添加一个附加变量(目标是绘制R ²与调整后的R ²).我发现很难通过算法创建这样的公式.我最终使用的代码如下所示(请注意，数据框中的第一列是因变量，其余所有都是预期的自变量.

In a recent homework assignment, we were instructed to run 27 linear models, each time adding an additional variable (the goal was to plot the changes in R² vs. changes in adjusted R²). I found it difficult to algorithmically create formulas like this. The code I ended up using looked like this following (note that the first column in the data frame is the dependent variable, all the rest are prospective independent variables.

 make.formula <- function(howfar) {
  formula <- c()
  for (i in 1:howfar) {
    if (i == 1) {
      formula <- paste(formula, names(d)[i], '~')}
    else if (i == howfar) {
      formula <- paste(formula, names(d)[i], '')
    }
    else {
      formula <- paste(formula, names(d)[i], '+')}
  }
  return(formula)
}

formulas <- lapply(seq(2, length(d)), make.formula)
formulas <- lapply(formulas, as.formula)
fits <- lapply(formulas, lm, data = d)

这可行，但似乎还不理想，我的印象是我在R中使用for循环所做的任何事情都可能不是最佳方法.有没有更简便的方法可以通过算法构造给定数据框的公式?

This works, but seems far from ideal, and my impression is that anything I'm doing with a for-loop in R is probably not being done the best way. Is there an easier way to algorithmically construct formulas for a given data frame?

推荐答案

reformulate()，一种从字符向量创建公式的漂亮函数，可能会派上用场.这是它的一个示例:

reformulate(), a nifty function for creating formulas from character vectors, might come in handy. Here's an example of what it does:

reformulate(response="Y", termlabels=c("X1", "X2", "X3"))
# Y ~ X1 + X2 + X3

这是您在实践中如何使用它的方法. (请注意，我在这里在lm()调用内部创建公式.由于formula对象随其附带了有关创建它们的环境的信息，因此我会很犹豫在外部创建它们 lm()调用中的>.):

And here's how you might use it in practice. (Note that I here create the formulas inside of the lm() calls. Because formula objects carry with them info about the environment they were created in, I'd be a bit hesitant to create them outside of the lm() call within which you actually want to use them.):

evars <- names(mtcars)[2:5] ii <- lapply(1:4, seq_len) lapply(ii, function(X) { coef(lm(reformulate(response="mpg", termlabels=evars[X]), data=mtcars)) }) # [[1]] # (Intercept) cyl # 37.88458 -2.87579 # # [[2]] # (Intercept) cyl disp # 34.66099474 -1.58727681 -0.02058363 # # [[3]] # (Intercept) cyl disp hp # 34.18491917 -1.22741994 -0.01883809 -0.01467933 # # [[4]] # (Intercept) cyl disp hp drat # 23.98524441 -0.81402201 -0.01389625 -0.02317068 2.15404553

这篇关于自动公式构建的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

自动公式构建 [英] Automated formula construction

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

自动公式构建 [英] Automated formula construction

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭