估计FLOPS中的GPU效率（CUDA示例） [英] Estimating the efficiency of GPU in FLOPS (CUDA SAMPLES)

查看：568 发布时间：2020/10/13 1:21:29 c++ cuda flops

本文介绍了估计FLOPS中的GPU效率（CUDA示例）的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在我看来，我并不完全了解FLOPS的概念。在CUDA SAMPLES中，有矩阵乘法示例（0_Simple / matrixMul）。在此示例中，每个矩阵乘法的FLOP（带浮点运算）的数量通过以下公式计算：

It seems to me, that I don't completely understand the conception of FLOPS. In CUDA SAMPLES, there is Matrix Multiplication Example (0_Simple/matrixMul). In this example the number of FLOPs (operations with floating point) per matrix multiplication is calculated via the formula:

 double flopsPerMatrixMul = 2.0 * (double)dimsA.x * (double)dimsA.y * (double)dimsB.x;

因此，这意味着为了乘以矩阵 A（nxm） 超过 B（mxk），我们需要这样做： 2 * n * m * k 使用浮点运算。

So, this means, that in order to multiply matrix A(n x m) over B(m x k), we need to do: 2*n*m*k operations with floating point.

但是，为了计算所得矩阵的1个元素 C（nxk） ，则必须执行 m 乘法和（m-1）加法运算。因此，操作总数（用于计算 nxk 个元素）为 m * n * k 乘积和（m-1）* n * k 加法。

However, in order to calculate 1 element of the resulting matrix C (n x k), one have to perform m multiplication and (m-1) addition operations. So, the total number of operations (to calculate n x k elements), is m*n*k multiplications and (m-1)*n*k additions.

当然，我们也可以将添加的数量设置为 m * n * k 操作数将为 2 * n * m * k ，其中一半是乘法，一半是加法。

Of course, we could set the number of additions to m*n*k as well, and the total number of operations will be 2*n*m*k, half of them are multiplications and half additions.

但是，我想乘法在计算上要比加法更昂贵。为什么将这两种类型的操作混合在一起？在计算机科学中总是这样吗？如何考虑两种不同类型的操作？

But, I guess, multiplication is more computationally expensive, than addition. Why this two types of operations are mixed up? Is it always the case in computer science? How can one take into account two different types of operations?

对不起，我的英语）

估计FLOPS中的GPU效率（CUDA示例） [英] Estimating the efficiency of GPU in FLOPS (CUDA SAMPLES)

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录关闭

估计FLOPS中的GPU效率（CUDA示例） [英] Estimating the efficiency of GPU in FLOPS (CUDA SAMPLES)

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录 关闭

登录关闭