python是否自动将IO和CPU或内存绑定的段并行化? [英] Is python automagically parallelizing IO- and CPU- or memory-bound sections?

查看：108 发布时间：2020/5/1 10:04:00 python linux performance text-files

本文介绍了python是否自动将IO和CPU或内存绑定的段并行化?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

考虑此代码，它比上一个问题(但仍然比我的实际问题简单得多)

Consider this code, which is less toyish than the one in the previous question (but still much simpler than my real one)

import sys
data=[]

for line in open(sys.argv[1]):
    data.append(line[-1])

print data[-1]

现在，我期望更长的运行时间(我的基准文件长65150224行)，甚至可能更长.事实并非如此，它在大约2分钟内以与以前相同的速度运行！

Now, I was expecting a longer run time (my benchmark file is 65150224 lines long), possibly much longer. This was not the case, it runs in ~ 2 minutes on the same hw as before!

data.append()是否非常轻巧?我不这么认为，因此我编写了这个伪代码对其进行测试:

Is it data.append() very lightweight? I don't believe so, thus I wrote this fake code to test it:

data=[]
counter=0
string="a\n"

for counter in xrange(65150224):
    data.append(string[-1])

print data[-1]

运行时间为1.5到3分钟(运行之间存在很大差异)

This runs in 1.5 to 3 minutes (there is strong variability among runs)

为什么以前的程序不能显示3.5至5分钟?显然data.append()与IO并行发生.

Why don't I get 3.5 to 5 minutes in the former program? Obviously data.append() is happening in parallel with the IO.

这是个好消息！

但是它如何工作?它是有文件记录的功能吗?对我的代码有什么要求，我应该遵循以使其尽可能发挥作用(除了负载平衡IO和内存/CPU活动之外)?还是仅仅是普通的缓冲/缓存操作?

But how does it work? Is it a documented feature? Is there any requirement on my code that I should follow to make it works as much as possible (besides load-balancing IO and memory/CPU activities)? Or is it just plain buffering/caching in action?

同样，我将此问题标记为"linux"，因为我只对特定于linux的答案感兴趣.如果您认为值得做的话，请随时提供与操作系统无关的答案，甚至与其他操作系统无关.

Again, I tagged "linux" this question, because I'm interested only in linux-specific answers. Feel free to give OS-agnostic, or even other-OS answers, if you think it's worth doing.

python是否自动将IO和CPU或内存绑定的段并行化? [英] Is python automagically parallelizing IO- and CPU- or memory-bound sections?

问题描述

推荐答案

相关文章

服务器开发最新文章

热门教程

热门工具

登录关闭

python是否自动将IO和CPU或内存绑定的段并行化? [英] Is python automagically parallelizing IO- and CPU- or memory-bound sections?

问题描述

推荐答案

相关文章

服务器开发最新文章

热门教程

热门工具

登录 关闭

登录关闭