最快的重合矩阵 [英] Fastest coincidence matrix

查看：126 发布时间：2020/5/18 22:25:35 python numpy

本文介绍了最快的重合矩阵的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有两个数组，我想计算一个巧合的列表/数组.即，列出所有索引i，j，以便a [i] == b [j].现在这是我的代码:

I have two arrays and I want to compute a list/array of coincidences. That is, a list of all the indices i, j so that a[i] == b[j]. This is my code now:

b = np.array([3, 5, 6, 4])
a = np.array([1, 2, 3, 4])

np.array([[i, j] for i in range(a.size) for j in range(b.size) if a[i] == b[j]])

是否有更快的方法(也许是Numpy驱动)来做到这一点?

Is there a faster (maybe numpy-powered) way to do this?

方法1

一种方法是使用np.in1d-

m_a = np.in1d(a,b)
I = np.flatnonzero(m_a)
J = np.flatnonzero(np.in1d(b, a[m_a]))

样本输入，输出-

In [367]: a
Out[367]: array([1, 2, 3, 4])

In [368]: b
Out[368]: array([3, 5, 6, 4])

In [370]: I
Out[370]: array([2, 3])

In [371]: J
Out[371]: array([0, 3])

方法2

broadcasting-

I,J = np.nonzero(a[:,None] == b)

方法3

对于输入数组中没有重复项的情况，可以使用np.searchsorted.这里有两种变体-一种用于排序的a，另一种用于通用的a.

Approach #3

For the case where we have no duplicates within the input arrays, we could use np.searchsorted. There are two variants here - One for sorted a, and another for a generic a.

变体#1:用于排序的a-

idx = np.searchsorted(a, b)
idx[idx==a.size] = 0
mask = a[idx] == b
I = np.searchsorted(a,b[mask])
J = np.flatnonzero(mask)

变体#2:对于这种通用变体情况，我们需要使用a-

Variant #2 : For this generic variant case, we need to use argsort indices of a -

sidx = a.argsort()
a_sort = a[sidx]
idx = np.searchsorted(a_sort, b)
idx[idx==a.size] = 0
mask = a_sort[idx] == b
I = sidx[np.searchsorted(a_sort,b[mask])]
J = np.flatnonzero(mask)

这篇关于最快的重合矩阵的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

最快的重合矩阵 [英] Fastest coincidence matrix

问题描述

推荐答案

方法1

方法2

方法3

Approach #3

相关文章

Python最新文章

热门教程

热门工具

登录关闭

最快的重合矩阵 [英] Fastest coincidence matrix

问题描述

推荐答案

方法1

方法2

方法3

Approach #3

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭