音乐识别和信号处理 [英] Music Recognition and Signal Processing

查看:236
本文介绍了音乐识别和信号处理的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我要的建立类似的东西来 Tunatic 或的 Midomi (试行出来,如果​​你不知道他们做什么),我想知道我不得不用什么算法;这个想法我有这样的应用程序的工作原理是这样的:

I want to build something similar to Tunatic or Midomi (try them out if you're not sure what they do) and I'm wondering what algorithms I'd have to use; The idea I have about the workings of such applications is something like this:


  1. 有一个很大的数据库,几首歌曲

  2. 在每首歌曲的 1 降低质量/比特率(以64kbps的举例),并计算了声哈希

  3. 有你想要找出音乐的声音/节选

  4. 在歌曲 3。降低质量/比特率(再次64kbps的),并计算出声音哈希

  5. 如果 4。声音哈希是任何在 2 声音散列返回匹配的音乐

  1. have a big database with several songs
  2. for each song in 1. reduce quality / bit-rate (to 64kbps for instance) and calculate the sound "hash"
  3. have the sound / excerpt of the music you want to identify
  4. for the song in 3. reduce quality / bit-rate (again to 64kbps) and calculate sound "hash"
  5. if 4. sound hash is in any of the 2. sound hashes return the matched music

我虽然降低了质量/比特率由于环境噪声和编码的差异。

I though of reducing the quality / bit-rate due to the environment noises and encoding differences.

我在正确的轨道在这里?谁能给我提供任何特定文档或例子?绿似乎甚至承认呜呜的,这是pretty赫然IM pressive!他们是怎么做到的?

Am I in the right track here? Can anyone provide me any specific documentation or examples? Midori seems to even recognize hum's, that's pretty awesomely impressive! How do they do that?

做无害哈希值存在,或者是它的东西我只是做了?如果他们这样做,我该怎么算呢?更重要的是,我怎么能检查子散列父亲散列

Do sound hashes exist or is it something I just made up? If they do, how can I calculate them? And more importantly, how can I check if child-hash is in father-hash?

我怎么会去的建筑与Python类似的系统(可能是内置模块)或PHP

一些例子(preferably在Python或PHP)将大大AP preciated。在此先感谢!

Some examples (preferably in Python or PHP) will be greatly appreciated. Thanks in advance!

推荐答案

我的工作,实现多种音乐信息检索技术的酷框架的外围。我绝对不是一个专家(编辑:其实,我无处接近的专家,我只想澄清),但我可以告诉大家,该快速傅立叶变换用于遍布这个东西的地方。傅立叶分析是古怪的,但其应用是pretty直线前进。基本上当你分析它在频域而不是时域,你可以得到很多关于音频信息。这是傅立叶分析给你。

I worked on the periphery of a cool framework that implements several Music Information Retrieval techniques. I'm hardly an expert (edit: actually i'm nowhere close to an expert, just to clarify), but I can tell that that the Fast Fourier Transform is used all over the place with this stuff. Fourier analysis is wacky but its application is pretty straight-forward. Basically you can get a lot of information about audio when you analyze it in the frequency domain rather than the time domain. This is what Fourier analysis gives you.

这可能有点题外话从你想要做什么。在任何情况下,在项目一些很酷的工具,一起玩,以及查看源代码code的核心库本身: HTTP: //marsyas.sness.net

That may be a bit off topic from what you want to do. In any case, there are some cool tools in the project to play with, as well as viewing the sourcecode for the core library itself: http://marsyas.sness.net

这篇关于音乐识别和信号处理的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆