OpenMPI中的动态节点 [英] Dynamic nodes in OpenMPI
问题描述
在MPI中,启动后是否可以添加新节点?例如,我有两台计算机已经在运行并行MPI应用程序.我在第三台计算机上启动此应用程序的另一个实例,并将其添加到现有的通信器中.所有计算机都在本地网络中.
否,当前无法将新节点添加到正在运行的MPI应用程序中. MPI旨在了解程序启动时的节点总数.
工作正在完成(例如,在 MPI-3 上)在处理掉落的节点上.也许如果您可以向后添加故障节点,那么您可以添加新的节点,但这是我能想到的最接近的东西.有关MPI容错方法的更多信息,请参见此答案. /p>
In MPI, is it possible to add new nodes after it is started? For example, I have 2 computers already running a parallel MPI application. I start another instance of this application on a third computer and add it to the existing communicator. All computers are in a local network.
No, it's not currently possible to add new nodes to a running MPI application. MPI is designed to know the total number of nodes when the program starts.
Work is being done (on MPI-3, for example) on handling nodes that go down. Maybe if you can add faulty nodes back, then you can add new ones, but that's the closest thing I can think of. See this answer for more info on approaches to MPI fault tolerance.
这篇关于OpenMPI中的动态节点的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!