为什么size_t和unsigned int比int慢? [英] Why are size_t and unsigned int slower than int?

查看：129 发布时间：2020/7/7 4:12:36 c++ performance int size-t

本文介绍了为什么size_t和unsigned int比int慢?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用下面的简单交换排序算法在Windows的Visual Studio项目中尝试使用不同的整数类型.处理器是英特尔.该代码在x64版中进行了编译.优化设置为最大速度(/O2)".与编译设置对应的命令行为

I was experimenting with different integer types in Visual Studio project in Windows using a simple exchange sort algorithm below. The processor is Intel. The code was compiled in Release x64. The optimization setting is "Maximize Speed (/O2)". The command line corresponding to the compilation settings is

/permissive- /GS /GL /W3 /Gy /Zc:wchar_t /Zi /Gm- /O2 /sdl /Fd"x64\Release\vc141.pdb" /Zc:inline /fp:precise /D "NDEBUG" /D "_CONSOLE" /D "_UNICODE" /D "UNICODE" /errorReport:prompt /WX- /Zc:forScope /Gd /Oi /MD /Fa"x64\Release\" /EHsc /nologo /Fo"x64\Release\" /Fp"x64\Release\SpeedTestForIntegerTypes.pch" /diagnostics:classic

代码本身:

#include <ctime>
#include <vector>
#include <iostream>

void sort(int N, int A[], int WorkArray[]) // exchange sort
{
    int i, j, index, val_min;
    for (j = 0; j < N; j++)
    {
        val_min = 500000;
        for (i = j; i < N; i++)
        {
            if (A[i] < val_min)
            {
                val_min = A[i];
                index = i;
            }
        }
        WorkArray[j] = A[j];
        A[j] = val_min;
        A[index] = WorkArray[j];
    }
}

int main()
{
    std::vector<int> A(400000), WorkArray(400000);
    for(size_t k = 0; k < 400000; k++)
        A[k] = 400000 - (k+1);

    clock_t begin = clock();

    sort(400000, &A[0], &WorkArray[0]);

    clock_t end = clock();
    double sortTime = double(end - begin) / CLOCKS_PER_SEC;
    std::cout << "Sort time: " << sortTime << std::endl;
    return 0;
}

WorkArray仅需要在排序之前保存向量. 关键是，这种排序花了我22.3秒才能完成.有趣的是，如果我将数组A，WorkArray(在std::vector和函数sort的参数列表中)的类型从int更改为size_t，以及对于val_min ，时间增加到67.4！这慢了三倍！新代码如下:

The WorkArray is only needed to save the vector before sorting. The point is, this sorting took me 22.3 seconds to complete. The interesting part is that if I change type int to size_t for arrays A, WorkArray (both in std::vector and in the argument list of function sort), as well as for val_min, the time increases to 67.4! This is threefold slower! The new code is below:

#include <ctime>
#include <vector>
#include <iostream>

void sort(int N, size_t A[], size_t WorkArray[]) // exchange sort
{
    int i, j, index;
    size_t val_min;
    for (j = 0; j < N; j++)
    {
        val_min = 500000U;
        for (i = j; i < N; i++)
        {
            if (A[i] < val_min)
            {
                val_min = A[i];
                index = i;
            }
        }
        WorkArray[j] = A[j];
        A[j] = val_min;
        A[index] = WorkArray[j];
    }
}

int main()
{
    std::vector<size_t> A(400000), WorkArray(400000);
    for(size_t k = 0; k < 400000; k++)
        A[k] = 400000 - (k+1);

    clock_t begin = clock();

    sort(400000, &A[0], &WorkArray[0]);

    clock_t end = clock();
    double sortTime = double(end - begin) / CLOCKS_PER_SEC;
    std::cout << "Sort time: " << sortTime << std::endl;
    return 0;
}

请注意，对于函数局部变量i，j，index，N，我仍然保持类型int，因此，i++和j++仅有的两个算术运算应采用在两种情况下执行相同的时间.因此，这种放缓与其他原因有关.它与内存对齐问题或寄存器大小有关吗?

Note that I still keep type int for function local variables i, j, index, N, and so the only two arithmetical operations that are i++ and j++ should take the same amount of time to perform in both cases. Therefore, this slowdown has to do with other reasons. Is it related to the memory alignment issue or register sizes or something else?

但是最令人发指的部分是当我将int更改为unsigned int时. unsigned int和int占用的字节数相同，为4(sizeof表明).但是unsigned int的运行时间为65.8 s！虽然第一个结果可以接受，但第二个结果却使我完全困惑！为什么运行这种甚至不涉及符号检查的简单算法所需的时间差异如此显着?

But the most outrageous part was when I changed int to unsigned int. Both unsigned int and int occupy the same number of bytes which is 4 (sizeof showed that). But the runtime for unsigned int was 65.8 s! While the first outcome was somewhat ok to accept, the second one totally confuses me! Why is there such a significant difference in time it takes to run such a simple algorithm that does not even involve sign checks?

感谢所有人解决这两个问题.从哪里可以开始阅读更多有关这些硬件级优化特性的信息?我不在乎排序算法本身，它仅用于说明问题.

Thanks to all addressing both of these questions. Where can I start reading more about these hardware-level optimization peculiarities? I don't care about the sorting algorithm itself, it's here for illustration of the problem only.

更新:我再次强调以下事实:在所有三种情况下，我都使用整数作为数组索引.

UPDATE: once again, I stress the fact that I use ints for array indices in all three cases.

为什么size_t和unsigned int比int慢? [英] Why are size_t and unsigned int slower than int?

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录关闭

为什么size_t和unsigned int比int慢? [英] Why are size_t and unsigned int slower than int?

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录 关闭

登录关闭