添加多个浮点变量时最大程度地减少浮点错误 [英] Minimize floating point error when adding multiple floating point variables

查看：72 发布时间：2020/11/8 21:15:42 c++ floating-point floating-accuracy

本文介绍了添加多个浮点变量时最大程度地减少浮点错误的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在我的c ++应用程序中，我有一个范围为(0,1)的双精度向量，并且我必须尽可能准确地计算其总数. 感觉应该早已解决此问题，但我找不到任何东西.

In my c++ app i have a vector of doubles in the range (0,1) and i have to calculate its total as accurately as possible. It feels like this issue should have been addressed before, but i cant find anything.

显然，如果向量大小很大并且某些项的大小显着小于其他项，那么迭代向量中的每个项并执行sum + = vect [i]会累积明显的误差.

Obviously iterating through each item on the vector and doing sum+=vect[i] accumulates a significant error if the vector size is large and there are items which are significantly smaller then the others.

我当前的解决方案是此功能:

My current solution is this function:

double sumDoubles(vector<double> arg)// pass by copy
{
  sort(arg.rbegin(),arg.rend());  // sort in reverse order
  for(int i=1;i<=arg.size();i*=2)
    for(int j=0;j<arg.size()-i;j+=(2*i))
        arg[j]+=arg[j+i];
  return arg[0];
}

基本上，它以升序对输入进行排序并计算成对和:

Basically it sorts the input in ascending order and calculates pairwise sums:

a + b + c + d + e + f + g + h =(((a + b)+(c + d))+((e + f)+(g + h))

a+b+c+d+e+f+g+h=((a+b)+(c+d))+((e+f)+(g+h))

就像构造一个二叉树一样，但是要就地进行.排序应确保两个步骤的每一步都具有可比较的大小.

Like constructing a binary tree, but doing it in place. Sorting should ensure that at each step the two summands are of comparable magnitude.

上面的代码确实比具有累加总和的单循环执行得更好. 但是我很好奇是否可以进一步提高精度而又不降低性能.

The code above does perform better than a single loop with accumulative sum. However i am curious if it is possible to increase precision further while not degrading performance too much.

添加多个浮点变量时最大程度地减少浮点错误 [英] Minimize floating point error when adding multiple floating point variables

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录关闭

添加多个浮点变量时最大程度地减少浮点错误 [英] Minimize floating point error when adding multiple floating point variables

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录 关闭

登录关闭