如何减少Sphinx4应用程序中的噪声 [英] How to Reduce Noise in Sphinx4 application

查看:104
本文介绍了如何减少Sphinx4应用程序中的噪声的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我对Sphinx4和语音识别系统是陌生的.我正在使用sphinx4创建语音应用程序.问题是噪声,即使没有用户的语音输入也导致程序识别,即即使用户没有语音输入也将语音转换为文本,从而影响准确性.

I am new to sphinx4 and speech recognition thing. I am creating a speech application by using sphinx4. The issue is noise, which results in recognition by program even without speech input by user ie it is converting speech to text even when there is no speech input by user, thus affecting accuracy.

主要问题是如何实现降噪.这是因为即使我对麦克风什么也没说,系统仍会检测到输入.所以我想这是因为噪音.

Main issue is how to implement noise reduction. This is because the system detects input even when I dont speak anything into the microphone. So I guess its because of noise.

我在网上检查了降噪功能,但几乎没有清楚的信息.尽管Internet上有一些有关sphinx4附带的名为Denoise.java的文件的信息.但是在sphinx4-1.06中不存在.

I checked online for noise reduction but there is hardly clear information about the same. Though there is some information over internet about file named Denoise.java which comes with sphinx4. But it is not there in sphinx4-1.06.

另一个文件是WienerFilter.java,wienerfilter是用于嘈杂信号的过滤器类型.但是没有使用或实现该文件的说明.

Another file is WienerFilter.java, wienerfilter is type of filter used for noisy signals. But no instructions for using or implementing that file.

我已经在hello.gram中添加了几个单词,hello.gram是程序的语法文件.由Imtool生成的这些多余单词的语音表示形式已添加到字典中.

I have already added few more words to hello.gram which is the grammar file for the program. The phonetic representation of those extra words, generated by Imtool, has been added to the dictionary.

我正在使用eclipse和sphinx4-1.0beta6

I am using eclipse and sphinx4-1.0beta6

尽管在SPHINX4中如何激活噪声取消"堆栈溢出有一个问题,但尚未得到答案

Though there is one question on stack overflow on "HOW TO ACITIVATE NOISE CANCELLATION " IN SPHINX4 but that has not been answered yet

推荐答案

在最新版本的sphinx4-5prealpha中,默认情况下启用了具有频谱减法的静态噪声消除功能.您无需执行任何特殊操作,只需使用最新版本即可.

Static noise cancellation with spectral subtraction is enabled by default in latest version sphinx4-5prealpha. You do not need to do anything special, just use the latest version.

按照教程进行:

http://cmusphinx.sourceforge.net/wiki/tutorialsphinx4

如果您使用的是语法解码,而不是使用语言模型的大词汇量解码,则最新版本的多余语音将被忽略.如果使用语法解码,则应忽略语法中的所有单词.对于不在语法中的单词,应返回一个特殊单词<unk>.

Extra speech is ignored in latest version if you are using grammar decoding, not large vocabulary decoding with language model. If you use grammar decoding it should ignore all the words in the grammar. For words not in the grammar it should return a special word <unk>.

精度调试是一个复杂的过程,需要测试记录才能重现精度问题.没有测试记录,很难建议您如何提高准确性.除了测试记录外,您还需要提供用于解码的模型和其他信息以重现结果.

Accuracy debugging is a complex process and requires a test recording to reproduce accuracy problems. Without test recording it is hard to suggest you how to improve accuracy. Beside test recording you need to provide models you use in decoding and other information to reproduce your results.

这篇关于如何减少Sphinx4应用程序中的噪声的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆