任何人都可以提出用于音频模式识别库? [英] Can anyone suggest a library for audio pattern recognition?

查看:253
本文介绍了任何人都可以提出用于音频模式识别库?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我的问题描述:

我有个样本的音频文件,并需要找到它在另一个音频文件(例如,以找到起始和在音频文件结束识别片段的时间)。是否有任何库执行此任务?

I have a sample audio file and need to find it in another audio file (for example, to find starting and ending time of recognition fragment in the audio file). Is there any library for performing this task?

推荐答案

1) CMU狮身人面像。这是一个开源工具包用于语音识别。

1) CMU Sphinx. It is a Open Source Toolkit For Speech Recognition.

说明:CMUSphinx是在BSD风格的许可证发布了一个独立扬声器大词汇量连续语音识别。这也是开源的工具和资源的集合,可以让研究人员和开发人员构建语音识别系统。

Description : CMUSphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.

2) RWTH ASR (短RASR)是一个开放源语音识别工具包。

2) RWTH ASR (short RASR) is an open source speech recognition toolkit.

说明:该工具包包括艺术语音识别技术的状态自动语音识别系统的发展。它已经被人类语言技术与模式识别集团开发的亚琛工业大学。

Description : The toolkit includes state of the art speech recognition technology for the development of automatic speech recognition systems. It has been developed by the Human Language Technology and Pattern Recognition Group at RWTH Aachen University.

3)朱利叶斯:朱利叶斯是高性能,双通大词汇量连续语音识别(LVCSR)德codeR软件进行语音相关的研究和开发人员。

3) Julius : "Julius" is a high-performance, two-pass large vocabulary continuous speech recognition (LVCSR) decoder software for speech-related researchers and developers.

可能是谷歌提供更多的结果,但我认为以上三个都绰绰有余了。

May be google provide more results, but i think above three are more than sufficient.

这篇关于任何人都可以提出用于音频模式识别库?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆