优化多线程numpy数组函数 [英] Optimizing a multithreaded numpy array function

查看：370 发布时间：2020/5/14 1:02:07 python multithreading numpy

本文介绍了优化多线程numpy数组函数的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

给定2个3D点的大型数组(我将第一个称为源"，第二个称为目标")，我需要一个函数，该函数将从目标"返回的索引与源"的元素匹配，该索引与源"的元素匹配最接近的，但有以下限制:我只能使用numpy ...所以没有scipy，pandas，numexpr，cython ...

Given 2 large arrays of 3D points (I'll call the first "source", and the second "destination"), I needed a function that would return indices from "destination" which matched elements of "source" as its closest, with this limitation: I can only use numpy... So no scipy, pandas, numexpr, cython...

为此，我编写了一个函数.我遍历source的元素，找到离目标最近的元素并返回其索引.由于性能问题，又由于只能使用numpy，我尝试使用多线程来加快速度.这是线程和非线程函数，以及它们在8核计算机上的速度比较.

To do this i wrote a function based on the "brute force" answer to this question. I iterate over elements of source, find the closest element from destination and return its index. Due to performance concerns, and again because i can only use numpy, I tried multithreading to speed it up. Here are both threaded and unthreaded functions and how they compare in speed on an 8 core machine.

import timeit
import numpy as np
from numpy.core.umath_tests import inner1d
from multiprocessing.pool import ThreadPool

def threaded(sources, destinations):
    # Define worker function
    def worker(point):
        dlt = (destinations-point) # delta between destinations and given point
        d = inner1d(dlt,dlt) # get distances
        return np.argmin(d) # return closest index

    # Multithread!
    p = ThreadPool()
    return p.map(worker, sources)


def unthreaded(sources, destinations):
    results = []
    #for p in sources:
    for i in range(len(sources)):
        dlt = (destinations-sources[i]) # difference between destinations and given point
        d = inner1d(dlt,dlt) # get distances
        results.append(np.argmin(d)) # append closest index

    return results


# Setup the data
n_destinations = 10000 # 10k random destinations
n_sources = 10000      # 10k random sources
destinations= np.random.rand(n_destinations,3) * 100
sources = np.random.rand(n_sources,3) * 100

#Compare!
print 'threaded:   %s'%timeit.Timer(lambda: threaded(sources,destinations)).repeat(1,1)[0]
print 'unthreaded: %s'%timeit.Timer(lambda: unthreaded(sources,destinations)).repeat(1,1)[0]

结果:

threaded:   0.894030461056
unthreaded: 1.97295164054

多线程处理似乎是有益的，但是我希望我处理的真实数据集更大得多，因此可以增加2倍以上.

Multithreading seems beneficial but I was hoping for more than 2X increase given the real life dataset i deal with are much larger.

所有提高性能的建议(在上述限制内)将不胜感激！

All recommendations to improve performance (within the limitations described above) will be greatly appreciated!

优化多线程numpy数组函数 [英] Optimizing a multithreaded numpy array function

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

优化多线程numpy数组函数 [英] Optimizing a multithreaded numpy array function

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭