如何将.wav文件转换为文本 [英] How to convert .wav file to text

查看:156
本文介绍了如何将.wav文件转换为文本的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我需要将.wav文件转换为文本。

我们可以从.wav文件中听到需要写入文本文件。

请为上述任何技术提供解决方案。

I need to convert a .wav file to text.
What we can hear from .wav file need to write in text file.
Please give solutions for any above technologies.

推荐答案

请参阅我对该问题的评论。这是我的答案。



我还记得几年前我尝试过一些开源产品。即使声学输入非常纯净并且基于西方平等气质系统(电子键盘),它们都没有表现出令人满意的结果。它可以被认为只是实验性的努力。我可以想象到现在有些产品已接近商业化质量(我可以想象一位优秀音乐家的反应;那些人通常可以随时捕捉相当复杂的音乐片段,超越任何可以想象的工具,我们可以超越任何语言工具在图像识别:-))。我相信,在与多年来熟悉声学,数学和音乐的最佳开发人员团队合作时,可能会取得一些重大进展。这真的非常非常困难。如果你在这样的水平上,你可能不会问这样的问题。







在OP的澄清之后回答。



好​​的,这被称为语音识别http://en.wikipedia.org/wiki/Speech_recognition [ ^ ]。



使用.NET,可以轻松访问可用的技术。您需要使用程序集speech.dll(来自GAC,因为它与.NET Framework捆绑在一起,因此,在Visual Studio中,使用添加引用窗口的.NET选项卡)。



请参阅: http:// msdn.microsoft.com/en-us/library/system.speech.recognition.aspx [ ^ ]。



但是,不要太兴奋。如果您只是为您的应用程序开发一些语音指令器,并且具有合理数量的不同命令,那么这种技术可以很好地工作。如果您尝试执行免费听写,您可以使用可用的听写语法: http: //msdn.microsoft.com/en-us/library/system.speech.recognition.dictationgrammar.aspx [ ^ ]。



你可以做到,但是结果......我会称他们令人沮丧。无论如何,据报道,即使是合理质量的听写技术也是商业上可获得的。也许它很快就会变得司空见惯......



-SA
Please see my comments to the question. This is my answer.

I also remembered that I tried some open-source products few years ago. Even though the acoustic input was very pure and based on Western equal temperament system (electronic keyboard), none of them demonstrated satisfactory results. It could be considered just experimental effort. I can imagine that by now some products are of near commercial quality (I can imagine the reaction of a good musician; those people can usually catch pretty complex pieces of music on the fly, surpassing any thinkable tools the was we can surpass any language tools in image recognition :-)). I can believe that some considerable progress is possible when working with the team of best developers well familiar with acoustics, mathematics and music, during a number of years. This is really very, very difficult. If you were on such level, you would not probably ask such question.



Answering after the clarification from the OP.

OK, this is called speech recognition: http://en.wikipedia.org/wiki/Speech_recognition[^].

With .NET, the available technology is easily accessible. You need to use the assembly "speech.dll" (from the GAC, as it is bundled with .NET Framework, so, in Visual Studio, use the tab ".NET" of the "Add Reference" window).

Please see: http://msdn.microsoft.com/en-us/library/system.speech.recognition.aspx[^].

However, don''t be too excited. This technology works reasonably well if you simply develop some speech commander for your application, with reasonable number of distinct commands. If you try to perform the free dictation, you can use available dictation grammar: http://msdn.microsoft.com/en-us/library/system.speech.recognition.dictationgrammar.aspx[^].

You can do it, but the results… I would call them frustrating. Anyway, reportedly, even the dictation technology of reasonable quality is commercially available. Maybe it can become a commonplace soon enough…

—SA


这篇关于如何将.wav文件转换为文本的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆