使用 OpenMP 关键和有序 [英] Using OpenMP critical and ordered

查看：18 发布时间：2022/1/14 10:06:44 fortran openmp

本文介绍了使用 OpenMP 关键和有序的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我对 Fortran 和 OpenMP 很陌生，但我正在努力了解自己的方向.我有一段代码用于计算我试图并行化的变异函数.但是，我似乎遇到了比赛条件，因为有些结果相差了千分之一左右.

I've quite new to Fortran and OpenMP, but I'm trying to get my bearings. I have a piece of code for calculating variograms which I'm attempting to parallelize. However, I seem to be getting race conditions, as some of the results are off by a thousandth or so.

问题似乎是减少.使用 OpenMP 缩减工作并给出正确的结果，但它们是不可取的，因为缩减实际上发生在另一个子例程中(我将相关行复制到 OpenMP 循环中进行测试).因此，我将减量放在 CRITICAL 部分中，但没有成功.有趣的是，这个问题只发生在实数上，而不是整数上.我考虑过添加的顺序是否有任何区别，但它们不应该产生这么大的错误.

The problem seems to be the reductions. Using OpenMP reductions work and give the correct results, but they are not desirable, because the reductions actually happen in another subroutine (I copied the relevant lines into the OpenMP loop for the test). Therefore I put the reductions inside a CRITICAL section but without success. Interestingly, the problem only occurs for reals, not integers. I have thought about whether or not the order of the additions make any difference, but they should not produce errors this big.

只是为了检查一下，我将并行执行的所有操作都放在一个 ORDERED 块中，这(当然)给出了正确的结果(尽管没有任何加速).我还尝试将所有内容都放在 CRITICAL 部分中，但由于某种原因没有给出正确的结果.我的理解是 OpenMP 将在进入/退出 CRITICAL 部分时刷新共享变量，因此不应该有任何缓存问题.

Just to check, I put everything in the parallel do in an ORDERED block, which (of course) gave the correct results (albeit without any speedup). I also tried putting everything inside a CRITICAL section, but for some reason that did not give the correct results. My understanding is that OpenMP will flush the shared variables upon entering/exiting CRITICAL sections, so there shouldn't be any cache problems.

所以我的问题是:为什么在这种情况下关键部分不起作用?

So my question is: why doesn't a critical section work in this case?

我的代码如下.除了 np、tm、hm、gam 之外的所有共享变量都是只读的.

My code is below. All shared variables except np, tm, hm, gam are read-only.

我试图通过用相同范围内的随机整数替换 do 循环来模拟由多个线程引起的随机性(即在循环中生成一对 i,j；如果它们被访问"，则生成新的那些)，令我惊讶的是结果匹配.然而，经过进一步检查，发现我忘记给RNG播种，结果是巧合.好尴尬！

I tried to simulate the randomness induced by multiple threads by replacing the do loops with random integers in the same range (i.e. generate a pair i,j in the of the loops; if they are "visited", generate new ones) and to my surprise the results matched. However, upon further inspection it was revealed that I had forgotten to seed the RNG, and the results were correct by coincidence. How embarrassing!

TL;DR:结果的差异是由浮点值的排序引起的.改用双精度会有所帮助.

TL;DR: The discrepancies in the results were caused by the ordering of the floating point values. Using double precision instead helps.

!$OMP PARALLEL DEFAULT(none) SHARED(nd, x, y, z, nzlag, nylag, nxlag, &
!$OMP& dzlag, dylag, dxlag, nvarg, ivhead, ivtail, ivtype, vr, tmin, tmax, np, tm, hm, gam) num_threads(512)
!$OMP DO PRIVATE(i,j,zdis,ydis,xdis,izl,iyl,ixl,indx,vrh,vrt,vrhpr,vrtpr,variogram_type) !reduction(+:np, tm, hm, gam)
  DO i=1,nd        
!$OMP CRITICAL (main)
! Second loop over the data:
    DO j=1,nd

! The lag:
      zdis = z(j) - z(i)
      IF(zdis >= 0.0) THEN
        izl =  INT( zdis/dzlag+0.5)
      ELSE
        izl = -INT(-zdis/dzlag+0.5)
      END IF
 ! ---- SNIP ----

! Loop over all variograms for this lag:

      DO cur_variogram=1,nvarg
        variogram_type = ivtype(cur_variogram)

! Get the head and tail values:

        indx = i+(ivhead(cur_variogram)-1)*maxdim
        vrh   = vr(indx)
        indx = j+(ivtail(cur_variogram)-1)*maxdim
        vrt   = vr(indx)
        IF(vrh < tmin.OR.vrh >= tmax.OR. vrt < tmin.OR.vrt >= tmax) CYCLE

        ! ----- PROBLEM AREA -------
        np(ixl,iyl,izl,1)  = np(ixl,iyl,izl,1) + 1.   ! <-- This never fails
        tm(ixl,iyl,izl,1)  = tm(ixl,iyl,izl,1) + vrt  
        hm(ixl,iyl,izl,1)  = hm(ixl,iyl,izl,1) + vrh
        gam(ixl,iyl,izl,1) = gam(ixl,iyl,izl,1) + ((vrh-vrt)*(vrh-vrt))
        ! ----- END OF PROBLEM AREA -----

        !CALL updtvarg(ixl,iyl,izl,cur_variogram,variogram_type,vrt,vrh,vrtpr,vrhpr)
      END DO
    END DO
    !$OMP END CRITICAL (main)
  END DO
!$OMP END DO
!$OMP END PARALLEL

提前非常感谢！

使用 OpenMP 关键和有序 [英] Using OpenMP critical and ordered

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

使用 OpenMP 关键和有序 [英] Using OpenMP critical and ordered

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭