使用广播/矢量化解决方案替换内部具有函数调用的循环 [英] Replacing for loops with function call inside with broadcasting/vectorized solution

查看：25 发布时间：2021/9/17 19:19:45 python-3.x performance numpy vectorization array-broadcasting

本文介绍了使用广播/矢量化解决方案替换内部具有函数调用的循环的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

当使用广播时，而不是广播标量来匹配数组，向量化函数由于某种原因将数组缩小为标量.

When using broadcasting, rather than broadcasting scalars to match the arrays, the vectorized function is instead, for some reason, shrinking the arrays to scalars.

下面是一个 MWE.它包含一个双循环.我无法编写不使用 for 循环的更快代码，而是使用广播/矢量化 numpy.

Below is a MWE. It contains a double for loop. I am having trouble writing faster code that does not use the for loops, but instead, uses broadcasting/vectorized numpy.

import numpy as np

def OneD(x, y, z):
    ret = np.exp(x)**(y+1) / (z+1)
    return ret 

def ThreeD(a,b,c):
    value = OneD(a[0],b[0], c)
    value *= OneD(a[1],b[1], c)
    value *= OneD(a[2],b[2], c)

    return value

M_1 = M_2 = [[0,0,0],[0,0,1], [1,1,1], [1,0,2]] 
scales0 = scales1 = [1.1, 2.2, 3.3, 4.4]
cc0 = cc1 = 1.77   
results = np.zeros((4,4))
for s0, n0, in enumerate(M_1):
    for s1, n1, in enumerate(M_2):
        v = ThreeD(n0, n1, s1)
        v *= cc0 * cc1 * scales0[s0] * scales1[s1]
        results[s0, s1] += v

虽然我想删除两个 for 循环，但为了简单起见，我首先尝试摆脱内部循环.随意回答，但两者都已删除.

While I want to remove both for loops, to keep it simple I am trying first to get rid of the inner loop. Feel free to answer with both removed however.

这是我改变循环的方式

rr = [0,1,2,3]
myfun = np.vectorize(ThreeD)
for s0, n0, in enumerate(M_1):
    #for s1, n1, in enumerate(M_2):
    v = myfun(n0, M_2, rr)
    v *= cc0 * cc1 * scales0[s0] * scales1[rr]
    results[s0, rr] += v

错误信息:

Traceback (most recent call last):                                                                                                                                               
  File "main.py", line 36, in <module>                                                                                                                                           
    v = myfun(n0, M_2, rr)                                                                                                                                                       
  File "/usr/lib/python3/dist-packages/numpy/lib/function_base.py", line 1573, in __call__                                                                                       
    return self._vectorize_call(func=func, args=vargs)                                                                                                                           
  File "/usr/lib/python3/dist-packages/numpy/lib/function_base.py", line 1633, in _vectorize_call                                                                                
    ufunc, otypes = self._get_ufunc_and_otypes(func=func, args=args)                                                                                                             
  File "/usr/lib/python3/dist-packages/numpy/lib/function_base.py", line 1597, in _get_ufunc_and_otypes                                                                          
    outputs = func(*inputs)                                                                                                                                                      
  File "main.py", line 18, in ThreeD                                                                                                                                             
    value = OneD(a[0],b[0], c)                                                                                                                                                   
IndexError: invalid index to scalar variable.

我还需要矢量化 OneD 函数吗?我希望通过对 ThreeD 函数进行矢量化，它会进行适当的记账.

Do I also need to vectorize the OneD function? I was hoping by vectorizing the ThreeD function, it would do the proper bookkeeping.

推荐答案

在你的循环中，n0 和 n1 是嵌套的 M_ 的元素> 列表，每 3 个元素.

In your loops, n0 and n1 are elements of the nested M_ lists, each 3 elements.

In [78]: ThreeD(np.arange(3),np.arange(3),3)                                                                    
Out[78]: 46.577468547527005

OneD 适用于数组，因此可以获得完整的 n 列表/数组:

OneD works with arrays, so can get the full n lists/arrays:

In [79]: OneD(np.arange(3), np.arange(3),3)                                                                     
Out[79]: array([  0.25      ,   1.84726402, 100.85719837])
In [80]: np.prod(_)                                                                                             
Out[80]: 46.577468547527005

并且产品匹配ThreeD.

只看双循环的 ThreeD 部分:

Looking just at the ThreeD part of your double loop:

In [81]: for s0, n0, in enumerate(M_1): 
    ...:     for s1, n1, in enumerate(M_2): 
    ...:         print(n0,n1,s1, ThreeD(n0, n1, s1)) 
    ...:                                                                                                        
[0, 0, 0] [0, 0, 0] 0 1.0
[0, 0, 0] [0, 0, 1] 1 0.125
[0, 0, 0] [1, 1, 1] 2 0.037037037037037035
[0, 0, 0] [1, 0, 2] 3 0.015625
[0, 0, 1] [0, 0, 0] 0 2.718281828459045
...
[1, 0, 2] [1, 0, 2] 3 46.577468547527005

从你的列表中创建数组:

Making arrays from your lists:

In [82]: M1 = np.array(M_1); M2 = np.array(M_2)                                                                 
In [83]: M1.shape                                                                                               
Out[83]: (4, 3)

我用这个广播调用复制了那些 ThreeD 结果:

I replicate those ThreeD results with this broadcasted call:

In [87]: np.prod(OneD(M1[:,None,:], M2[None,:,:], np.arange(4)[None,:,None]), axis=2)                           
Out[87]: 
array([[1.00000000e+00, 1.25000000e-01, 3.70370370e-02, 1.56250000e-02],
       [2.71828183e+00, 9.23632012e-01, 2.73668744e-01, 3.13836514e-01],
       [2.00855369e+01, 6.82476875e+00, 1.49418072e+01, 6.30357490e+00],
       [2.00855369e+01, 1.85516449e+01, 1.49418072e+01, 4.65774685e+01]])

我将 (4,1,3)、(1,4,3) 和 (1,4,1) 数组传递给 OneD.结果是 (4,4,3)，然后我在最后一个轴上乘以得到 (4,4).

I am passing (4,1,3), (1,4,3) and (1,4,1) arrays to OneD. The result is (4,4,3), which I then multiply on the last axis to make a (4,4).

剩下的计算是:

In [88]: (cc0*cc1*np.array(scales0)[:,None]*np.array(scales1)[None,:])                                          
Out[88]: 
array([[ 3.790809,  7.581618, 11.372427, 15.163236],
       [ 7.581618, 15.163236, 22.744854, 30.326472],
       [11.372427, 22.744854, 34.117281, 45.489708],
       [15.163236, 30.326472, 45.489708, 60.652944]])

In [89]: _87*_88        # multiplying these two 4x4 arrays                                                                           
Out[89]: 
array([[3.79080900e+00, 9.47702250e-01, 4.21201000e-01, 2.36925563e-01],
       [2.06089744e+01, 1.40052502e+01, 6.22455564e+00, 9.51755427e+00],
       [2.28421302e+02, 1.55228369e+02, 5.09773834e+02, 2.86747781e+02],
       [3.04561737e+02, 5.62605939e+02, 6.79698445e+02, 2.82506059e+03]])

匹配`results:

which matches `results:

In [90]: results                                                                                                
Out[90]: 
array([[3.79080900e+00, 9.47702250e-01, 4.21201000e-01, 2.36925563e-01],
       [2.06089744e+01, 1.40052502e+01, 6.22455564e+00, 9.51755427e+00],
       [2.28421302e+02, 1.55228369e+02, 5.09773834e+02, 2.86747781e+02],
       [3.04561737e+02, 5.62605939e+02, 6.79698445e+02, 2.82506059e+03]])

这篇关于使用广播/矢量化解决方案替换内部具有函数调用的循环的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

使用广播/矢量化解决方案替换内部具有函数调用的循环 [英] Replacing for loops with function call inside with broadcasting/vectorized solution

问题描述

错误信息:

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

使用广播/矢量化解决方案替换内部具有函数调用的循环 [英] Replacing for loops with function call inside with broadcasting/vectorized solution

问题描述

错误信息:

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭