性能启动开销:为什么执行MOV + SYS_exit的简单静态可执行文件为何会有如此多的停顿周期(和指令)? [英] Perf startup overhead: Why does a simple static executable which performs MOV + SYS_exit have so many stalled cycles (and instructions)?

查看：101 发布时间：2020/4/23 11:11:01 linux performance assembly x86-64 perf

本文介绍了性能启动开销:为什么执行MOV + SYS_exit的简单静态可执行文件为何会有如此多的停顿周期(和指令)?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我试图了解如何衡量性能，并决定编写一个非常简单的程序:

I'm trying to understand how to measure performance and decided to write the very simple program:

section .text
    global _start

_start:
    mov rax, 60
    syscall

然后我用perf stat ./bin运行了程序.令我惊讶的是stalled-cycles-frontend太高了.

And I ran the program with perf stat ./bin The thing I was surprised by is the stalled-cycles-frontend was too high.

      0.038132      task-clock (msec)         #    0.148 CPUs utilized          
             0      context-switches          #    0.000 K/sec                  
             0      cpu-migrations            #    0.000 K/sec                  
             2      page-faults               #    0.052 M/sec                  
       107,386      cycles                    #    2.816 GHz                    
        81,229      stalled-cycles-frontend   #   75.64% frontend cycles idle   
        47,654      instructions              #    0.44  insn per cycle         
                                              #    1.70  stalled cycles per insn
         8,601      branches                  #  225.559 M/sec                  
           929      branch-misses             #   10.80% of all branches        

   0.000256994 seconds time elapsed

据我了解stalled-cycles-frontend，这意味着CPU前端必须等待某些操作(例如总线事务)的结果完成.

As I understand the stalled-cycles-frontend it means that CPU frontend has to wait for the result of some operation (e.g. bus-transaction) to complete.

那么在最简单的情况下，导致CPU前端大部分时间等待的原因是什么?

So what caused CPU frontend to wait for most of the time in that simplest case?

还有2个页面错误?为什么?我没有读取任何内存页面.

And 2 page faults? Why? I read no memory pages.

性能启动开销:为什么执行MOV + SYS_exit的简单静态可执行文件为何会有如此多的停顿周期(和指令)? [英] Perf startup overhead: Why does a simple static executable which performs MOV + SYS_exit have so many stalled cycles (and instructions)?

问题描述

推荐答案

相关文章

服务器开发最新文章

热门教程

热门工具

登录关闭

性能启动开销:为什么执行MOV + SYS_exit的简单静态可执行文件为何会有如此多的停顿周期(和指令)? [英] Perf startup overhead: Why does a simple static executable which performs MOV + SYS_exit have so many stalled cycles (and instructions)?

问题描述

推荐答案

相关文章

服务器开发最新文章

热门教程

热门工具

登录 关闭

登录关闭