语音识别API [英] Speech Recognition API

查看:173
本文介绍了语音识别API的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我需要自动抄写一些简短的MP3作为概念证明我工作的一部分。目前我正在进入云计算解决方案或Web API服务送MP3作为一个简单的HTTP请求和接收转录回来。

I need to automatically transcribe some short MP3s as part of a proof of concept I am working on. I am currently looking into cloud solutions or web API services to send the MP3 as a simple HTTP request and receive a transcription back.

唯一的自由/开源解决方案,我发现这里,但演示似乎不工作(至少不上的文件,我需要抄写)。我已经找到了呼叫中心的一些企业级解决方案,但到目前为止没有什么我可以简单地集成到一个项目。

The only free/open source solution I have found here, but the demos don't seem to work (at least not on the files I need to transcribe). I have found some enterprise solutions for call centers, but so far nothing I can simply integrate into a project.

是否有任何基于网络的语音识别服务可用?一个能够过滤掉噪音小者优先。

Are there any web based speech recognition services available? One that is able to filter out small noise would be a plus.

推荐答案

下面是一个非官方的方法访问谷歌ASR能力。我只是测试昨天,它仍然有效 - 你可以从16K​​Hz的采样的音频FLC用言语JSON风格ASR输出和相关的置信度

Here is an unofficial method to access Google ASR capability. I just tested on Yesterday and it still works - you can get JSON style ASR output with words and associated confidence score from an FLC audio sampled in 16KHz.

这篇关于语音识别API的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆