给定它们的wav文件,是否可以比较两首相似的歌曲? [英] Is it possible to compare two similar songs given their wav files?

查看:78
本文介绍了给定它们的wav文件,是否可以比较两首相似的歌曲?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有一个大型的较旧音乐库(1920年代,30年代,40年代等),其中有很多重复音乐,我想确定这些重复音乐,并使用相同的MP3标签信息进行整理.由于音乐是前一段时间录制的,尽管它们听起来听起来像人耳,但其录制实际上可能有所不同(更安静,更静态等等).

I have a large library of older music (1920s, 30s, 40s, etc), with a lot of duplicates, and I'd like to determine the duplicates and organize them with the same MP3 tag information. Since the music was recorded a while ago, while they may sound similar to a human ear, their recording might actually be a bit different (quieter, more static, etc).

我目前正在使用pydub解析某些音乐,并且能够生成wav文件,消除歌曲开头和结尾的静音,并压缩音乐的动态范围,但是我想希望能够比较wav文件,以便如果它们足够相似,那么我可以假定它们是相同的,并赋予它们相同的标签.

I'm currently parsing some of the music using pydub and am able to generate the wav file, remove the silence at the beginning and end of the songs, and compress the dynamic range of the music, but I'd like to be able to compare the wav files so that if they are similar enough, I can assume they are the same and give them the same tags.

是否可以通过诸如scipynumpy的方式运行wav文件数据,以使用傅里叶变换/FFT之类的方式对数据进行良好的比较/关联?我知道可以使用 dejavu 这样的系统来完成,但是它非常密集并且使用很多存储在数据库中,并且我可以访问原始文件,而不是使用麦克风,因此我希望做一些更简单的事情.

Would it be possible to run the wav file data through something like scipy and numpy to compare/correlate the data with good accuracy, using something like a Fourier Transform/FFT? I know that it can be done with a system like dejavu but it's pretty intensive and uses a lot of storage in a database, and I have access to the raw files rather than using a microphone, so I'd prefer to do something that's simpler.

推荐答案

您需要音频哈希声学指纹 .它们都是密集型"的(需要大量资源),因为它们必须解压缩音频并提取数据.

You need audio hash or acoustic fingerprint. All of them are "intensive" (resource-hungry) as they have to unpack audio and extract data.

这篇关于给定它们的wav文件,是否可以比较两首相似的歌曲?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆