语音到文本的大型音频文件 [Microsoft Speech API] [英] Speech-to-text large audio files [Microsoft Speech API]

本文介绍了语音到文本的大型音频文件 [Microsoft Speech API]的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

使用 Microsoft Speech API 转录中/大型音频文件(每个文件约 6-10 分钟)的最佳方法是什么?像批处理音频文件转录这样的东西?

What is the best way to transcribe medium/large audio files, ~ 6-10 mins each file, using Microsoft Speech API? Something like batch audio files transcription?

我使用了 https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-to-text-sample,用于持续转录语音,但它停止转录某点.转录有什么限制吗?我只使用免费试用帐户 atm.

I have used the code provided in https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-to-text-sample, for continuously transcribing speech, but it stops transcribing at some point. Is there any restriction on the transcription? I am only using the free trial account atm.

顺便说一句,我认为 Bing Speech API 和新的语音服务 API 之间没有区别,对吗?

Btw, I assume there is no difference between Bing Speech API and the new Speech service API, right?

谢谢大家!

推荐答案

感谢您的反馈.

我同意示例(以及您正在查看的文档)不是很清楚,我们会尽快更新.

I agree the sample (and the documentation you are looking at) is not very clear, we will update this soon.

示例使用 RecognizeAsync,应该调用 RecognizeOnceAsync.它目前只是试图从服务返回 FIRST FinalResult.您应该使用 Start/StopRecognizeAsync,并注册以接收 Result 事件.

The sample uses RecognizeAsync, and it should be call RecognizeOnceAsync. It is currently just trying to return the FIRST FinalResult from the service. You should use Start/StopRecognizeAsync, and register to receive Result events.

再次抱歉,这里的文档不好,我们会尽快更新,并且可能会在刷新时重命名 API.

Again, sorry for the bad documentation here, we will update this soon, and also will rename the API probably in a refresh.

如果您有音频文件,也可以使用批量转录功能.也许这有帮助?https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/batch-transcription

If you have audio files, you could also use the batch transcription feature. Perhaps that helps? https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/batch-transcription

干杯沃尔夫冈

这篇关于语音到文本的大型音频文件 [Microsoft Speech API]的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆