语音识别API [英] Speech Recognition API
问题描述
我需要自动抄写一些简短的MP3作为概念证明我工作的一部分。目前我正在进入云计算解决方案或Web API服务送MP3作为一个简单的HTTP请求和接收转录回来。
I need to automatically transcribe some short MP3s as part of a proof of concept I am working on. I am currently looking into cloud solutions or web API services to send the MP3 as a simple HTTP request and receive a transcription back.
唯一的自由/开源解决方案,我发现这里,但演示似乎不工作(至少不上的文件,我需要抄写)。我已经找到了呼叫中心的一些企业级解决方案,但到目前为止没有什么我可以简单地集成到一个项目。
The only free/open source solution I have found here, but the demos don't seem to work (at least not on the files I need to transcribe). I have found some enterprise solutions for call centers, but so far nothing I can simply integrate into a project.
是否有任何基于网络的语音识别服务可用?一个能够过滤掉噪音小者优先。
Are there any web based speech recognition services available? One that is able to filter out small noise would be a plus.
推荐答案
下面是一个非官方的方法访问谷歌ASR能力。我只是测试昨天,它仍然有效 - 你可以从16KHz的采样的音频FLC用言语JSON风格ASR输出和相关的置信度
Here is an unofficial method to access Google ASR capability. I just tested on Yesterday and it still works - you can get JSON style ASR output with words and associated confidence score from an FLC audio sampled in 16KHz.
这篇关于语音识别API的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!