使用OpenMP进行归纳:获取OpenMP中并行的for循环的范围值 [英] Induction with OpenMP: getting range values for a parallized for loop in OpenMP

查看：415 发布时间：2020/5/21 1:32:56 openmp

本文介绍了使用OpenMP进行归纳:获取OpenMP中并行的for循环的范围值的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我想知道一种在带有C ++的OpenMP中的for循环中获取给定线程的值范围的方法.例如，在下面的代码中，我想知道每个线程在每个线程的循环中使用的第一个值.

I would like to know a way to get the range of values for a given thread in a parallized for loop in OpenMP with C++. For example in the following code I would like to know what the first value each thread uses in the loop for each thread.

#pragma omp parallel for schedule(static)
for(int i=0; i<n; i++)

让我举个例子说明为什么我可能需要这些值.假设我想用计数值的总和填充一个数组.计数总数之和的封闭式解决方案是n*(n+1)/2.为此，我可以执行以下操作:

Let me give you an example of why I might want these values. Let's assume I want to fill an array with the sum of the counting numbers. The closed form solution for the sum of the counting number is n*(n+1)/2. To do this with OpenMP I could do this:

#pragma omp parallel for schedule(static)
for(int i=0; i<n; i++) {    
    a[i] = i*(i+1)/2;
}

但是，我怀疑一种获取计数总和的更快方法是，不要在每次迭代(都有一个平方)时都不使用封闭形式的解，而应该记住每次迭代的总和是这样的:

However, I suspect a faster method to get the sum of the counting numbers is to not use the closed form solution each iteration (which has a square) and instead remember the sum each iteration like this:

int cnt = 0;
for(int i=0; i<n; i++) {
    cnt += i;
    a[i] = cnt;
}

但是我能想到的使用OpenMP做到这一点的唯一方法就是像这样明确定义范围值:

But the only way to do this with OpenMP I can think of is explictly define the range values like this:

#pragma omp parallel
{
    const int ithread = omp_get_thread_num();
    const int nthreads = omp_get_num_threads();
    const int start = ithread*n/nthreads;
    const int finish = (ithread+1)*n/nthreads;

    int cnt = 0;
    int offset = (start-1)*(start)/2;
    for(int i=start; i<finish; i++) {
        cnt += i;
        a[i] = cnt + offset;
    }
}

如果可以从#pragma omp parallel for schedule(static)获取起始值，则不必定义start, finish, ithread, and nthreads.

If I could get the start value from #pragma omp parallel for schedule(static) then I would not have to define start, finish, ithread, and nthreads.

阅读 Agner Fog的Optimizing C ++ 手册后，我意识到我在做的事情叫做归纳法. 他举了一个使用归纳法更有效地计算多项式值的示例.这是他的手册中的一些例子

After reading Agner Fog's Optimizing C++ manual I realized that what I am doing is called induction. He gives an example of using induction to more efficiently calculate the values of a polynominal. Here are some examples from his manual

没有归纳法:

// Example 8.23a. Loop to make table of polynomial
const double A = 1.1, B = 2.2, C = 3.3; // Polynomial coefficients
double Table[100]; // Table
int x; // Loop counter
for (x = 0; x < 100; x++) {
    Table[x] = A*x*x + B*x + C; // Calculate polynomial

带有感应:

// Example 8.23b. Calculate polynomial with induction variables
const double A = 1.1, B = 2.2, C = 3.3; // Polynomial coefficients
double Table[100]; // Table
int x; // Loop counter
const double A2 = A + A; // = 2*A
double Y = C; // = A*x*x + B*x + C
double Z = A + B; // = Delta Y
for (x = 0; x < 100; x++) {
    Table[x] = Y; // Store result
    Y += Z; // Update induction variable Y
    Z += A2; // Update induction variable Z
}

要使用OpenMP做到这一点，我需要获取每个块的起始值.使用OpenMP执行此操作的唯一方法是手动定义块.

To do this with OpenMP I need to get the start value for each chunk. The only way to do this with OpenMP is to define the chunks manually.

使用OpenMP进行归纳:获取OpenMP中并行的for循环的范围值 [英] Induction with OpenMP: getting range values for a parallized for loop in OpenMP

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

使用OpenMP进行归纳:获取OpenMP中并行的for循环的范围值 [英] Induction with OpenMP: getting range values for a parallized for loop in OpenMP

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭