文本到语音(语音生成)和语音到文本(语音识别)API? [英] Text-to-speech (voice generation) and speech-to-text (voice recognition) APIs?

查看：57 发布时间：2021/9/6 19:46:42 speech-recognition text-to-speech speech-to-text speech-synthesis

本文介绍了文本到语音(语音生成)和语音到文本(语音识别)API?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

是否有针对桌面或浏览器环境的已知 API 的完整列表?

Is there a comprehensive list of known APIs for desktop or browser environments?

推荐答案

我将重新整理并更新 C、Java 或 PHP 中的语音识别?.这绝不是全面的，但它可能是您的一个开始

I'll rehash and update an answer from Speech recognition in C or Java or PHP?. This is by no means comprehensive, but it might be a start for you

通过几个月的观察这些问题，我看到大多数开发者的选择都是这样分解的:

From watching these questions for few months, I've seen most developer choices break down like this:

Windows 用户 - 使用 .Net 或 Microsoft.Speech 的 System.Speech 功能并安装 Microsoft 提供的免费识别器.Windows 7 包括一个完整的语音引擎.其他的可以免费下载.有一个称为 SAPI 的相同引擎的 C++ API.请参阅 http://msdn.microsoft.com/en-us/magazine/cc163663.aspx.或 http://msdn.microsoft.com/en-us/library/ms723627(v=vs.85).aspx.有关适用于 Windows 的 Microsoft 引擎的更多背景信息System.Speech 有什么区别.Recognition 和 Microsoft.Speech.Recognition?

Windows folks - use the System.Speech features of .Net or Microsoft.Speech and install the free recognizers Microsoft provides. Windows 7 includes a full speech engine. Others are downloadable for free. There is a C++ API to the same engines known as SAPI. See at http://msdn.microsoft.com/en-us/magazine/cc163663.aspx. or http://msdn.microsoft.com/en-us/library/ms723627(v=vs.85).aspx. More background on Microsoft engines for Windows What is the difference between System.Speech.Recognition and Microsoft.Speech.Recognition?

Linux 人 - Sphinx 似乎有很多追随者.请参阅 http://cmusphinx.sourceforge.net/ 和 http://cmusphinx.sourceforge.net/wiki/

Linux folks - Sphinx seems to have a good following. See http://cmusphinx.sourceforge.net/ and http://cmusphinx.sourceforge.net/wiki/

商业产品 - Nuance、Loquendo, AT&T、IBM、其他.每个都为各种语言提供自己的 SDK 和库.

Commercial products - Nuance, Loquendo, AT&T, IBM, others. Each provide their own SDKs and libraries for various languages.

在线服务 - Nuance、Yapme、ispeech.org、vlingo、其他.Nuance 改进了他们的开发者计划，现在您可以免费访问他们的服务以进行开发.Yap(我相信)最近由亚马逊购买，所以我们可能会在那里看到一些变化.

Online service - Nuance, Yapme, ispeech.org, vlingo, others. Nuance has improved their developer program and will now give you free access to their services for development. Yap (I believe) was recently purchased by Amazon, so we may see some changes there.

当然这也可能有帮助 - http://en.wikipedia.org/wiki/List_of_speech_recognition_software

Of course this may also be helpful - http://en.wikipedia.org/wiki/List_of_speech_recognition_software

有一个 Java 语音 API.请参阅 Java Speech API 中的 javax.speech.recognition http://java.sun.com/products/java-media/speech/forDevelopers/jsapi-guide/Recognition.html.相信你还是要找一个支持这个API的语音引擎.我不认为 Sphinx 完全支持它 - http://cmusphinx.sourceforge.net/sphinx4/doc/Sphinx4-faq.html#support_jsapi

There is a Java speech API. See javax.speech.recognition in the Java Speech API http://java.sun.com/products/java-media/speech/forDevelopers/jsapi-guide/Recognition.html. I believe you still have to find a speech engine that supports this API. I don't think Sphinx fully supports it - http://cmusphinx.sourceforge.net/sphinx4/doc/Sphinx4-faq.html#support_jsapi

还有很多其他问题:需要适用于 Linux 的文本转语音和语音识别工具和 pyspeech (python) - 转录 mp3 文件? 讨论 http://code.google.com/p/pyspeech/.您可能还想查看 http://code.google.com/p/dragonfly/

There are lots of other SO quesitons: Need text to speech and speech recognition tools for Linux and pyspeech (python) - Transcribe mp3 files? which talks about http://code.google.com/p/pyspeech/. You may also want to look at http://code.google.com/p/dragonfly/

这篇关于文本到语音(语音生成)和语音到文本(语音识别)API?的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

文本到语音(语音生成)和语音到文本(语音识别)API? [英] Text-to-speech (voice generation) and speech-to-text (voice recognition) APIs?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

文本到语音(语音生成)和语音到文本(语音识别)API? [英] Text-to-speech (voice generation) and speech-to-text (voice recognition) APIs?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭