如何从语音文件中识别短语 [英] How to recognize a phrase from a voice file

查看:42
本文介绍了如何从语音文件中识别短语的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

如何使引擎成功识别语音文件(wav/mp3/etc.)中的短语?

例如,如果我要有一个语音文件和同一个文件上下文的书面文本,那么要使其能够识别语音文件中的书面文字.

我尝试使用SpeechRecognitionEngine,但到目前为止没有成功.

我会很感激想法,因为这是我第一次处理语音识别技术.

我已经看到了使用字典进行语音转文本的示例,但是我不确定它在这里有什么用.我当时正在考虑将所有语音文件转换为文本,然后只是在该文本中查找特定的短语,但是我认为这不是正确的方法.将例如5个小时的语音转换为文本似乎没有任何意义,或者将特定的短语用作词典"并在语音文件中查找此项似乎并不合理.

解决方案

似乎您需要在长文件中查找特定单词.此技术称为关键字发现",它与语音识别有很大不同,但效率更高.显然,您无需转录整个文件即可搜索其中的单词,因此可以快速浏览整个文件.Microsoft语音识别引擎对关键字发现的支持非常有限.

CMUSphinx 这样的开源引擎可以用来有效地实现关键字发现.有关更多参考,请参见如何使用Pocketsphinx实现唤醒监听的信息../p>

有关基础算法的更多信息,请参见在应用程序的语音中发现声学关键字进行数据挖掘

How to get the engine to successfully recognize a phrase from a voice file (wav/mp3/etc..)?

For example, if I'll have a voice file and a written text of the context of the same file, so to make it recognize the written words in the voice file.

I tried to play around with the SpeechRecognitionEngine, but without success so far.

I'll appreciate ideas, since this is my first time dealing with Speech Recognition techniques.

I've seen examples of speech-to-text using dictionaries, but I'm not sure how it can be useful here. I was thinking of maybe converting the all voice file to text, and then simply look for the specific phrase in that text, but I don't think it's the right way. Doesn't seem to make sense to convert for example 5hrs voice to text.... or maybe to use the specific phrase as a "dictionary" and to look for this item in the voice file.

解决方案

It seems you need to look for a specific word in a long file. This technique is called "Keyword Spotting", it's quite different from speech recognition, way more efficient. Obviosly you do not need to transcribe the whole file to search a word in it, you can quickly scan through the file. Microsoft Speech Recognition engine have very limited support for keyword spotting.

Open source engines like CMUSphinx could be used to implement the keyword spotting efficiently. See for the further references the information on how to implement wake-up listening with pocketsphinx.

For the more information on the underlying algorithms see ACOUSTIC KEYWORD SPOTTING IN SPEECH WITH APPLICATIONs TO DATA MINING

这篇关于如何从语音文件中识别短语的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆