OpenMP并行化(块矩阵多) [英] OpenMP parallelization (Block Matrix Mult)

查看：84 发布时间：2020/5/7 19:27:53 c matrix openmp matrix-multiplication

本文介绍了OpenMP并行化(块矩阵多)的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试实现块矩阵乘法并使它更加并行化.

I'm attempting to implement block matrix multiplication and making it more parallelized.

这是我的代码:

int i,j,jj,k,kk;
float sum;
int en = 4 * (2048/4);
    #pragma omp parallel for collapse(2) 
for(i=0;i<2048;i++) {
    for(j=0;j<2048;j++) {
        C[i][j]=0;
    }
}
for (kk=0;kk<en;kk+=4) {
    for(jj=0;jj<en;jj+=4) {
        for(i=0;i<2048;i++) {
            for(j=jj;j<jj+4;j++) {
                sum = C[i][j];
                for(k=kk;k<kk+4;k++) {
                    sum+=A[i][k]*B[k][j];
                }
                C[i][j] = sum;
            }
        }
    }
}

我一直在使用OpenMP，但是仍然无法确定在最短的时间内完成此操作的最佳方法.

I've been playing around with OpenMP but still have had no luck in figuring what the best way to have this done in the least amount of time.

OpenMP并行化(块矩阵多) [英] OpenMP parallelization (Block Matrix Mult)

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

OpenMP并行化(块矩阵多) [英] OpenMP parallelization (Block Matrix Mult)

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭