语音控制API - 高精度特定的词组 [英] Voice control API - high accuracy on specific phrases

查看:133
本文介绍了语音控制API - 高精度特定的词组的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有几个想法,语音控制应用程序。不幸的是,根据我从Siri的和谷歌语音操作所看到的,这项技术似乎并不十分在那里呢。即使是在一个完全安静的环境中,准确度是如此糟糕,它常常感到更容易将其输入到您的手机。

I have several ideas for voice controlled apps. Unfortunately, based on what I've seen from Siri and Google Voice Actions, the technology doesn't seem to quite be there yet. Even in a perfectly quiet environment, the accuracy is so bad, that it often feels much easier to type it into your phone.

,使任务更容易将是该系统限制一对夫妇的命令,专门选择听起来非常不同的,而不是通过声音向服务和刚刚起步的背课文的方法之一。

One way to make the task easier would be to limit the system to a couple of commands, specifically chosen to sound very different, as opposed to passing the sound to a service and just getting the text back.

所以我的要求是:

  • 在非常高的精度要求时,用有限的命令集工作
  • preferable为它工作在移动设备上,但仅适用于PC库可能是太有用
  • 离线再次preferable,但没有必要
  • 无需开源 - 许可细

有没有这样的API或软件存在?

Does such an API or software exist?

推荐答案

我最近已参与开发移动基于语法的语音识别应用的平台项目,具有以下特点:

I have been recently involved in a project developing a platform for mobile grammar-based speech recognition applications, with the following features:

  • The grammars are written in Grammatical Framework, see: http://kaljurand.github.com/Grammars/
  • The server is based on Sphinx, see: https://github.com/alumae/ruby-pocketsphinx-server
  • The server can be accessed from Android, see: https://code.google.com/p/recognizer-intent/

所有的组件都是开源的,它不应该太难自己的服务器和端口的系统设置为你的语言,因为你有声学模型用于该语言。

All the components are open source and it shouldn't be too hard to set up your own server and port the system to your language, given that you have the acoustic models for that language.

这篇关于语音控制API - 高精度特定的词组的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆