没有互联网的iPhone中的大词汇量语音识别? [英] Large vocabulary speech recognition in iPhone without internet?

查看:125
本文介绍了没有互联网的iPhone中的大词汇量语音识别?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我使用需要字典的Openears。当我们在字典中提到这个词时,这很有用。我想转换我们说的所有单词。所以我使用Nuance的演讲来识别dragaon SDK。但它与网络服务器通信。出于安全考虑,我想避免服务器通信。是否可以将语音转换为文本,因为我们所说的所有单词都是在Windows Mobile中,而不是仅在离线模式下与服务器通信?

I used Openears which needs dictionary. It is usefull when we mention the word in dictionary. I wanted to convert all words we speak. So I used Nuance’s speech to recognition dragaon SDK. But it communicates with webserver. I want to avoid server communication because of security concerns. Is it possible to convert speech to text for all words we speak as it is in windows mobile without communicating server only in offline mode?

推荐答案

语音识别具有无限的词汇量需要非常大的计算和内存资源(千兆字节的内存),因此很难做到这一点在iPhone上的其他嵌入式设备上。 iPhone比桌面慢9倍。 iPad更容易,因为它具有更强大的CPU。

Speech recognition with unlimited vocabulary requires very big computational and memory resources (gigabytes of memory) and thus it's very hard to do that in iPhone on other embedded device. iPhone is 9 times slower than desktop. iPad is easier since it has more powerful CPU.

谷歌已经付出了巨大努力使他们的引擎脱机工作以进行听写,并且仍然倾向于将数据发送到服务器因为它更准确。

Google has put very big effort to make their engine work offline for dictation, and still it prefers to send data to the server because it is significantly more accurate.

因为在小型设备上运行的大多数解决方案使用的词汇量有限。虽然这个词汇量足够大,但你不会注意到这一点。通常500-1000字足以涵盖大多数实际情况。您可以使用OpenEars来识别这样的词汇。

Because of that most of the solutions running on small devices use limited vocabulary. Though this vocabulary can be large enough so you will not notice that. Usually 500-1000 words is enough to cover most practical situations. You can use OpenEars to recognize such vocabulary.

要训练语言模型,您需要来自您的域的文本(单词和表达)。语言模型培训在 CMUSphinx教程中进行了描述。要使用语言模型,您可以使用以下OpenEars API调用:

To train a language model you need texts from your domain (words and expressions). Language model training is described in CMUSphinx tutorial. To use language model you can use the following OpenEars API call:

- (void) changeLanguageModelToFile:     (NSString *)    languageModelPathAsString
withDictionary:     (NSString *)    dictionaryPathAsString 

参见 API参考了解更多详情。

您可以将OpenEars与此类词汇及相应语言结合使用模型,以支持您的设备的自由格式文本输入。

You can use OpenEars with such vocabulary and corresponding language model to support free form text entry for your device.

这篇关于没有互联网的iPhone中的大词汇量语音识别?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆