如何加快Eigen库的矩阵产品的速度? [英] How to speed up Eigen library's matrix product?

查看：1105 发布时间：2020/5/6 12:42:44 matlab mex eigen

本文介绍了如何加快Eigen库的矩阵产品的速度?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用Eigen库研究两个大矩阵的简单乘法.对于相同大小的矩阵，这种乘法似乎比Matlab和Python都慢.

I'm studying simple multiplication of two big matrices using the Eigen library. This multiplication appears to be noticeably slower than both Matlab and Python for the same size matrices.

是否有什么方法可以使本征操作更快?

Is there anything to be done to make the Eigen operation faster?

问题详细信息

X:随机的1000 x 50000矩阵

X : random 1000 x 50000 matrix

Y:随机50000 x 300矩阵

Y : random 50000 x 300 matrix

计时实验(在我2011年末的Macbook Pro上进行)

使用Matlab:X * Y大约需要1.3秒

Using Matlab: X*Y takes ~1.3 sec

使用有思想的Python:numpy.dot(X，Y)花费约2.2秒

Using Enthought Python: numpy.dot( X, Y) takes ~ 2.2 sec

使用特征值:X * Y耗时约2.7秒

Using Eigen: X*Y takes ~2.7 sec

特征详细信息

您可以获取我的本征代码(作为MEX函数): https://gist.github.com /michaelchughes/4742878

You can get my Eigen code (as a MEX function): https://gist.github.com/michaelchughes/4742878

此MEX函数从Matlab读取两个矩阵，然后返回它们的乘积.

This MEX function reads in two matrices from Matlab, and returns their product.

在没有矩阵乘积运算的情况下运行此MEX函数(即仅执行IO)所产生的开销可以忽略不计，因此，该函数与Matlab之间的IO不能解释性能上的巨大差异.显然，这是实际的矩阵乘积运算.

Running this MEX function without the matrix product operation (ie just doing the IO) produces negligible overhead, so the IO between the function and Matlab doesn't explain the big difference in performance. It's clearly the actual matrix product operation.

我正在使用g ++进行编译，并带有以下优化标志:"-O3 -DNDEBUG"

I'm compiling with g++, with these optimization flags: "-O3 -DNDEBUG"

我正在使用最新的稳定的Eigen头文件(3.1.2).

I'm using the latest stable Eigen header files (3.1.2).

关于如何改善本征性能的任何建议?有人可以复制我看到的空白吗?

Any suggestions on how to improve Eigen's performance? Can anybody replicate the gap I'm seeing?

更新编译器似乎真的很重要.原始Eigen计时是使用Apple XCode的g ++版本:llvm-g ++-4.2完成的.

UPDATE The compiler really seems to matter. The original Eigen timing was done using Apple XCode's version of g++: llvm-g++-4.2.

当我使用通过MacPorts下载的g ++-4.7(相同的CXXOPTIMFLAGS)时，我得到的是2.4秒而不是2.7秒.

When I use g++-4.7 downloaded via MacPorts (same CXXOPTIMFLAGS), I get 2.4 sec instead of 2.7.

对于如何更好地进行编译的任何其他建议，将不胜感激.

Any other suggestions of how to compile better would be much appreciated.

您还可以获取此实验的原始C ++代码: https://gist.github.com/michaelchughes /4747789

You can also get raw C++ code for this experiment: https://gist.github.com/michaelchughes/4747789

./MatProdEigen 1000 50000 300

在g ++-4.7下报告2.4秒

reports 2.4 seconds under g++-4.7

如何加快Eigen库的矩阵产品的速度? [英] How to speed up Eigen library's matrix product?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

如何加快Eigen库的矩阵产品的速度? [英] How to speed up Eigen library&#39;s matrix product?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

如何加快Eigen库的矩阵产品的速度? [英] How to speed up Eigen library's matrix product?

登录关闭