服务器端语音识别 [英] Server-side Voice Recognition

查看:264
本文介绍了服务器端语音识别的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

任何人都知道已经托管的任何好的服务器端语音识别引擎?也就是说我想能够调用一个简单的Web API发布一些声音数据和获取文本。

Anyone know of any good server side voice recognition engines that are already hosted? I.e. I want to be able to call a simple web API posting some sound data and get text back. Doesn't have to be free - but hopefully free to experiment with.

推荐答案

有几种IVR服务可以托管整个VOIP会话(电话呼叫)作为完整的应用,而不是提供单独的服务事务àlacarte。如果你使你的程序看起来像一个VOIP呼叫,你可能能够完成这些服务。

There are several IVR services which host an entire VOIP session (telephone call) as a complete application, rather than offer individual service transactions "àla carte". If you were to make your program look like a VOIP call, you might be able to get it done with some of these services.

Voxeo 发布了免费(和低成本)IVR托管提供商列表针对开发人员有限的使用。毫不奇怪,所有都需要注册。

Voxeo published a list of free (and low cost) IVR hosting providers aimed towards developers for limited use. Not surprisingly, all will require registration.

  • VoiceGenie Developer Workshop (absorbed into Genesys)
  • Loquendo C@fé status unknown
  • Nuance Café (Bevocal) now Nuance On-Demand
  • Plum Voice Hosting now Plum DEV
  • VOICE Testcenter of the VOICE Community

另一种可能性是直接询问 Vlingo Twilio Tropo ,因为他们可能会卖给你你所需要的。

Another possibility would be to make a direct inquiries with Vlingo, Twilio, or Tropo as they might sell you exactly what you need.

AT& T宣布推出 Speech API 。您发送音频 - 它返回XML或JSON数据格式的文本。另请参见开发人员网站

AT&T has announced availability of a Speech API on . You send it audio – it returns text in XML or JSON data formats. See also developer site.

另一种可能性是 Dragon Mobile SDK ,这是针对个人开发者寻找一个API,使消费者应用程序能够使用语音和/或文字转语音功能。

Another possibility is the Dragon Mobile SDK from Nuance, which is aimed at individual developers looking for an API enabling consumer applications with speech and/or text-to-speech functionality.

似乎有几家新的供应商提供完全您要寻找的内容:语音样本,文本输出。 可编程网络上列出了以下内容:

There seem to be several new providers offering exactly what you are looking for: speech samples in, text out. The following are listed on Programmable Web:

  • iSpeech
  • SpeechAPI
  • OneTok
  • AISpeech API
  • NexiWave

另请注意, Loquendo 现在是Nuance的一部分了。

Also note that Loquendo is now part of Nuance.

AT& T的Speech API有几个有针对性的SDK Android ,iOS,PhoneGap,Titanium,W​​indows) - 其中一些托管在 GitHub 。甚至还有 Unity 3D演示的源代码

AT&T's Speech API has a few targeted SDKs (Android, iOS, PhoneGap, Titanium, Windows) - some of which are hosted on GitHub. There's even source for a Unity 3D demo.

已重新配置为 iOS / code>。

OneTok has reformulated it's offerings as an SDK for iOS and Android.

显然, Voice Genie 产品已被 Genesys ,以便能找到它的痕迹。鉴于Genesys对大企业的定位,很难知道他们是否有任何小批量或商品产品。

Apparently the Voice Genie product has been thoroughly digested by Genesys such that little trace of it can be found. Given Genesys' positioning towards large enterprises, is difficult to know if they have any small-volume or commodity offerings.

Plumvoice 似乎已经扩展了他们的产品。

Plumvoice seems to have expanded their offerings.

之前, Vlingo 现在是Nuance的一部分。

As with many before it, Vlingo is now part of Nuance.

(我尝试更新原始答案中的任何损坏的链接。)

(I've tried to update any broken links in original answer.)

保持此答案是最新的是一个Sisyphean任务。

Keeping this answer up-to-date is a Sisyphean task.

Voxeo的免费(和低成本)IVR托管提供商列表现在重新更正为 AT& T Speech API ,在充分公开的情况下,我现在在其中有重大的参与,因此,取消了我提供连接到几乎任何东西,而不会影响我的可信度。

Voxeo's list of free (and low cost) IVR hosting providers now re-derects to AT&T Speech API, which, in full disclosure, I now have material involvement with therein, and as such, disqualifies me from providing linking to pretty much anything without impugning my credibility.

也就是说,在演讲/ NLP市场中有很多玩家。勤奋。

That said, there are many players in the speech/NLP market. Do diligence.

现在 Google彻底打乱了苹果车

这篇关于服务器端语音识别的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆