如何获得准确的绩效衡量标准? [英] How to get an accurate performance measure?

查看：42 发布时间：2021/6/15 19:53:22 c++ linux performance

本文介绍了如何获得准确的绩效衡量标准?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在我们的项目中，我们试图自动监控测试运行的性能，以确保我们的程序性能不会随着时间的推移而发生任何重大变化.

In our project we're trying to automatically monitor the performance of test runs, to make sure that we don't have any significant changes in the performance of the program over time.

问题在于，我们得到的测量值似乎始终存在 5% 的可变性.也就是说，在具有相同程序(没有重新编译)运行相同测试的同一台机器上，我们得到的值在每次运行之间相差大约 5%.这对于我们想要使用这些数字的目的来说太过分了.

The problem is that there seems to be a consistent 5% variability in the measures we get. That is, on the same machine with the same program (no recompilation) running the same test we get values that differ by around 5% from run to run. This is way too much for what we want to use the numbers for.

我们已经从时间考虑中排除了设置成本——也就是说，在 C++ 代码本身中，我们在运行时间关键部分之前和之后立即抓住时间，而不是对整个程序进行计时操作系统级别.我们也在做平均和异常值排除.问题是变异性看起来也有长期趋势，所以我们对重复的时间进行了紧密的聚类，但一两个小时后，时间就大不相同了.(不幸的是，将测试分散到几个小时内是不可行的.)测试也在一台专用机器上运行，而没有其他东西"在上面运行.

We're already excluding setup costs from the timing considerations - that is, from within C++ code itself we're grabbing the time immediately before and after running the time-critical portions, rather than doing the timing of the whole program on the OS level. We are also doing averaging and outlier exclusion. The problem is that the variability looks to also have long-term trends, so we get tight clustering of times for replicates right after each other, but an hour or two later the times are substantially different. (Unfortunately, spreading the test out over several hours is not feasible.) The tests are also being run on a dedicated machine while "nothing else" is being run on it.

我们不太确定时间变化的来源，但这可能与处理器和系统有关 - 有迹象表明变化的大小取决于程序运行的机器.

We're not quite sure where the timing variation is coming from, but it may have to do with the processor and the system - there's indications that the size of the variability depends on what machine the program is running on.

有没有人知道这种变化可能来自哪里，以及如何删除它?测试在专用机器上运行，因此可以更改操作系统设置.

Does anyone have an idea where this variation is likely to be coming from, and how to remove it? The tests are running on a dedicated machine, so changing the operating system settings would be possible.

(如标签所示，这是一个在 x86 Linux 系统上运行的 C++ 程序，如果这有助于澄清事情.)

(As indicated by the tags, this is a C++ program running on a x86 Linux system, if that helps clarify things.)

回复评论

我们当前的计时方案是使用 C 标准库中的 clock() 函数，查看我们要测试的函数之前/之后返回值的差异.

Our current timing scheme is to use the clock() function from the C standard library, looking at the difference in the return value from before/after the functions we want to test.

我们正在测试的代码应该是确定性的，不应涉及大量 IO.

The code we're testing should be deterministic, and shouldn't involve heavy IO.

我意识到情况对于银弹"答案来说有点模糊.我想我更多的是在寻找这些是需要考虑的重要因素，这是您可能应该检查它们的顺序，这是您检查每个因素的方法"类型答案.

I realize that the situation is a little hazy for a "silver bullet" answer. I guess I'm more looking for a "these are the factors that are important to consider, this is the order you probably should check them in, and here's how you go about checking each of them" type answer.

如何获得准确的绩效衡量标准? [英] How to get an accurate performance measure?

问题描述

推荐答案

相关文章

服务器开发最新文章

热门教程

热门工具

登录关闭

如何获得准确的绩效衡量标准? [英] How to get an accurate performance measure?

问题描述

推荐答案

相关文章

服务器开发最新文章

热门教程

热门工具

登录 关闭

登录关闭