编译器生成代价高昂的 MOVZX 指令 [英] Compiler generates costly MOVZX instruction

查看：65 发布时间：2021/6/12 20:47:08 c++ assembly optimization profiling x86-64

本文介绍了编译器生成代价高昂的 MOVZX 指令的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我的分析器已将以下函数分析确定为热点.

My profiler has identified the following function profiling as the hotspot.

typedef unsigned short ushort;

bool isInteriorTo( const std::vector<ushort>& point , const ushort* coord , const ushort dim )
{
    for( unsigned i = 0; i < dim; ++i )
    {
        if( point[i + 1] >= coord[i] ) return false;
    }

    return true;  
}

特别是一个汇编指令 MOVZX(零扩展移动) 负责大部分运行时.if语句编译成

In particular one assembly instruction MOVZX (Move with Zero-Extend) is responsible for the bulk of the runtime. The if statement is compiled into

mov     rcx, QWORD PTR [rdi]
lea     r8d, [rax+1]
add     rsi, 2
movzx   r9d, WORD PTR [rsi-2]
mov     rax, r8
cmp     WORD PTR [rcx+r8*2], r9w
jae     .L5

我想劝说编译器不要生成这条指令，但我想我首先需要了解为什么会生成这条指令.考虑到我正在使用相同的数据类型，为什么要加宽/零扩展?

I'd like to coax the compiler out of generating this instruction but I suppose I first need to understand why this instruction is generated. Why the widening/zero extension, considering that I'm working with the same data type?

(在 godbolt 编译器资源管理器上查找整个函数.)

(Find the entire function on godbolt compiler explorer.)

编译器生成代价高昂的 MOVZX 指令 [英] Compiler generates costly MOVZX instruction

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录关闭

编译器生成代价高昂的 MOVZX 指令 [英] Compiler generates costly MOVZX instruction

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录 关闭

登录关闭