为什么这种朴素的矩阵乘法比基数R快? [英] Why is this naive matrix multiplication faster than base R's?

查看：120 发布时间：2020/5/7 19:42:41 r performance rcpp matrix-multiplication

本文介绍了为什么这种朴素的矩阵乘法比基数R快?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在R中，矩阵乘法非常优化，即实际上只是对BLAS/LAPACK的调用.但是，令我惊讶的是，这种用于矩阵向量乘法的非常幼稚的C ++代码似乎可靠地快了30％.

In R, matrix multiplication is very optimized, i.e. is really just a call to BLAS/LAPACK. However, I'm surprised this very naive C++ code for matrix-vector multiplication seems reliably 30% faster.

 library(Rcpp)

 # Simple C++ code for matrix multiplication
 mm_code = 
 "NumericVector my_mm(NumericMatrix m, NumericVector v){
   int nRow = m.rows();
   int nCol = m.cols();
   NumericVector ans(nRow);
   double v_j;
   for(int j = 0; j < nCol; j++){
     v_j = v[j];
     for(int i = 0; i < nRow; i++){
       ans[i] += m(i,j) * v_j;
     }
   }
   return(ans);
 }
 "
 # Compiling
 my_mm = cppFunction(code = mm_code)

 # Simulating data to use
 nRow = 10^4
 nCol = 10^4

 m = matrix(rnorm(nRow * nCol), nrow = nRow)
 v = rnorm(nCol)

 system.time(my_ans <- my_mm(m, v))
#>    user  system elapsed 
#>   0.103   0.001   0.103 
 system.time(r_ans <- m %*% v)
#>   user  system elapsed 
#>  0.154   0.001   0.154 

 # Double checking answer is correct
 max(abs(my_ans - r_ans))
 #> [1] 0

基数R的%*%是否执行我跳过的某种数据检查?

Does base R's %*% perform some type of data check that I'm skipping over?

在了解了发生了什么之后(谢谢！)，值得注意的是，对于R的%*%，这是最坏的情况，即矢量矩阵.例如，@ RalfStubner指出，使用矩阵向量乘法的RcppArmadillo实现比我演示的朴素实现还要快，这意味着比基础R快得多，但实际上与矩阵R的基础R %*%相同乘(当两个矩阵都大且平方时):

After understanding what's going on (thanks SO!), it's worth noting that this is a worst case scenario for R's %*%, i.e. matrix by vector. For example, @RalfStubner pointed out that using an RcppArmadillo implementation of a matrix-vector multiply is even faster than the naive implementation that I demonstrated, implying considerable faster than base R, but is virtually identical to base R's %*% for matrix-matrix multiply (when both matrices are large and square):

 arma_code <- 
   "arma::mat arma_mm(const arma::mat& m, const arma::mat& m2) {
 return m * m2;
 };"
 arma_mm = cppFunction(code = arma_code, depends = "RcppArmadillo")

 nRow = 10^3 
 nCol = 10^3

 mat1 = matrix(rnorm(nRow * nCol), 
               nrow = nRow)
 mat2 = matrix(rnorm(nRow * nCol), 
               nrow = nRow)

 system.time(arma_mm(mat1, mat2))
#>   user  system elapsed 
#>   0.798   0.008   0.814 
 system.time(mat1 %*% mat2)
#>   user  system elapsed 
#>   0.807   0.005   0.822

因此，R的当前(v3.5.0)%*%对于矩阵矩阵几乎是最佳的，但是如果您可以跳过检查，则可以大大提高矩阵向量的速度.

So R's current (v3.5.0) %*% is near optimal for matrix-matrix, but could be significantly sped up for matrix-vector if you're okay skipping the checking.

为什么这种朴素的矩阵乘法比基数R快? [英] Why is this naive matrix multiplication faster than base R's?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

为什么这种朴素的矩阵乘法比基数R快? [英] Why is this naive matrix multiplication faster than base R&#39;s?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

为什么这种朴素的矩阵乘法比基数R快? [英] Why is this naive matrix multiplication faster than base R's?

登录关闭