为什么矢量化速度更快 [英] Why is vectorization faster

查看：32 发布时间：2021/9/17 19:16:30 r vectorization

本文介绍了为什么矢量化速度更快的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我已经学习 R 一段时间了，并且遇到了很多关于像我这样的编程类型以向量化操作的建议.作为一名程序员，我对为什么/如何更快感兴趣.一个例子:

I've been learning R for a while now, and have come across a lot of advice to programming types like myself to vectorize operations. Being a programmer, I'm interested as to why / how it's faster. An example:

n = 10^7
# populate with random nos
v=runif(n)
system.time({vv<-v*v; m<-mean(vv)}); m
system.time({for(i in 1:length(v)) { vv[i]<-v[i]*v[i] }; m<-mean(vv)}); m

这给了

   user  system elapsed 
   0.04    0.01    0.07 
[1] 0.3332091

   user  system elapsed 
  36.68    0.02   36.69 
[1] 0.3332091

需要考虑的最明显的事情是我们正在运行本机代码，即从 C 或 C++ 编译的机器代码，而不是解释代码，如两个示例之间用户时间的巨大差异所示(大约 3 个数量级)震级).但是还有其他事情吗?例如，R 会做什么:

The most obvious thing to consider is that we're running native code, i.e. machine code compiled from C or C++, rather than interpreted code, as shown by the massive difference in user time between the two examples (circa 3 orders of magnitude). But is there anything else going on? For example, does R do:

巧妙的原生数据结构，例如存储稀疏向量或矩阵的巧妙方法，以便我们只在需要时进行乘法?

Cunning native data structures, e.g. clever ways of storing sparse vectors or matrices so that we only do multiplications when we need to?

惰性求值，例如在矩阵乘法中，直到需要时才计算单元格.

Lazy evaluation, e.g. on a matrix multiply, don't evaluate cells until as and when you need to.

并行处理.

别的东西.

为了测试是否可能存在一些稀疏向量优化，我尝试使用不同向量内容进行点积

To test whether there might be some sparse vector optimization I tried doing dot products with difference vector contents

# populate with random nos
v<-runif(n)
system.time({m<-v%*%v/n}); m
# populate with runs of 1 followed by 99 0s
v <-rep(rep(c(1,rep(0,99)),n/100))
system.time({m<-v%*%v/n}); m
# populate with 0s
v <-rep(0,n)
system.time({m<-v%*%v/n}); m

然而在时间上没有显着差异(大约过去了 0.09)

However there was no significant difference in time (circa 0.09 elapsed)

(Matlab 的类似问题:为什么在 MATLAB 中，矢量化代码是否比 for 循环运行得更快?)

(Similar question for Matlab: Why does vectorized code run faster than for loops in MATLAB?)

为什么矢量化速度更快 [英] Why is vectorization faster

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

为什么矢量化速度更快 [英] Why is vectorization faster

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭