分割工作，多个线程需要更多的时间，为什么呢？ [英] Dividing work to more threads takes more time, why?

查看：129 发布时间：2016/8/23 10:12:17 c multithreading performance pthreads multiprocessing

本文介绍了分割工作，多个线程需要更多的时间，为什么呢？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个计算小的C程序的 PI 的使用 monte-卡罗 -simulation这基本上只是测试了一个随机点[X，Y]如果它是内部还是一个圈子之外。

I have a small C program which calculates pi using a monte-carlo-simulation which basically just tests for a random point [x,y] if it's inside or outside a circle.

要逼近的 PI 的我必须使用大量的样本的 N 的有直接正比复杂的 O（N）的。因此，试图到n计算样本数量巨大，我实现了 POSIX线程 API来parallize的计算能力

To approximate pi I have to use a high number of samples n which has a direct proportional complexity of O(n). So trying to calculate a huge number of samples n, I implemented POSIX threads api to parallize the computational power.

我的code是这样的：

My code looks like this:

pthread_t worker[nthreads]; /* creates workers for each thread */
struct param aparam[nthreads]; /* struct param{ long* hits; long rounds; }; */
long nrounds = nsamples / nthreads; /* divide samples to subsets of equal rounds per thread */

for (int i = 0; i < nthreads; ++i) { /* loop to create threads */
    aparam[i].hits = 0;
    aparam[i].rounds = nrounds;
    pthread_create(&worker[i], NULL, calc_pi, &aparam[i]); /* calls calc_pi(void* vparam){}  */ 
}

long nhits = 0;
for (int j = 0; j < nthreads; ++j) { /* collects results */
    pthread_join(worker[j], NULL);
    nhits += (long)aparam[j].hits; /* counts hits inside the cicrle */
}

这是每个线程正在做：

And this is what each thread is doing:

void* calc_pi(void* vparam)
{ /* counts hits inside a circle */
    struct param *iparam;
    iparam = (struct param *) vparam;
    long hits = 0;
    float x, y, z;
    for (long i = 0; i < iparam->rounds; ++i) {
        x = (float)rand()/RAND_MAX;
        y = (float)rand()/RAND_MAX;
        z = x * x + y * y;
        if (z <= 1.f) /* circle radius of 1 */
            ++hits;
    }
    iparam->hits = (long*)hits;
    return NULL;
}

现在我有一个奇怪的观察。与同组样品的 N 的和越来越多的线程的 I 的这个计划需要，而不是更少的更多的时间

Now I have a strange observation. With the same set of samples n and with an increasing number of threads i this program takes more time instead of less.

下面是一些平均运行时间（可重放）：

Here are some average run times (reproducable):

-------------------------------------------------
| Threads[1] | Samples[1] | Rounds[1] | Time[s] |
-------------------------------------------------
|        32  |  268435456 |   8388608 |    118  |
|        16  |  268435456 |  16777216 |    106  |
|         8  |  268435456 |  33554432 |    125  |
|         4  |  268435456 |  67108864 |    152  |
|         2  |  268435456 | 134217728 |     36  |
|         1  |  268435456 | 268435456 |     15  |
-------------------------------------------------

为什么两个线程做同样的工作比服用双倍的时间超过一个单线程更是实例？我的假设是两个线程划分工作应至少减少50％的时间。

Why is for instance two threads doing the same work taking more than double of the time than one single thread? My assumption is that two threads dividing the work should reduce the time by at least 50%.

用gcc编译4.9.1及以下标志：

Compiled with GCC 4.9.1 and the following flags:

gcc -O2 -std=gnu11 -pthread pipa.c -lpthread -o pipa

我的硬件是一个双Intel Xeon E5520（2个处理器，每个4核）@ 2.26 GHz的超线程禁用，运行Linux的科学与2.6.18内核。

My hardware is a Dual Intel Xeon E5520 (2 processors with each 4 cores) @ 2.26 GHz, hyperthreading disabled, running scientific linux with 2.6.18 kernel.

任何想法？

分割工作，多个线程需要更多的时间，为什么呢？ [英] Dividing work to more threads takes more time, why?

问题描述

推荐答案

相关文章

C/C++最新文章

热门教程

热门工具

登录关闭

分割工作，多个线程需要更多的时间，为什么呢？ [英] Dividing work to more threads takes more time, why?

问题描述

推荐答案

相关文章

C/C++最新文章

热门教程

热门工具

登录 关闭

登录关闭