语音到文本的大型音频文件 [Microsoft Speech API] [英] Speech-to-text large audio files [Microsoft Speech API]
问题描述
使用 Microsoft Speech API 转录中/大型音频文件(每个文件约 6-10 分钟)的最佳方法是什么?像批处理音频文件转录这样的东西?
What is the best way to transcribe medium/large audio files, ~ 6-10 mins each file, using Microsoft Speech API? Something like batch audio files transcription?
我使用了 https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-to-text-sample,用于持续转录语音,但它停止转录某点.转录有什么限制吗?我只使用免费试用帐户 atm.
I have used the code provided in https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-to-text-sample, for continuously transcribing speech, but it stops transcribing at some point. Is there any restriction on the transcription? I am only using the free trial account atm.
顺便说一句,我认为 Bing Speech API 和新的语音服务 API 之间没有区别,对吗?
Btw, I assume there is no difference between Bing Speech API and the new Speech service API, right?
谢谢大家!
推荐答案
感谢您的反馈.
我同意示例(以及您正在查看的文档)不是很清楚,我们会尽快更新.
I agree the sample (and the documentation you are looking at) is not very clear, we will update this soon.
示例使用 RecognizeAsync,应该调用 RecognizeOnceAsync.它目前只是试图从服务返回 FIRST FinalResult.您应该使用 Start/StopRecognizeAsync,并注册以接收 Result 事件.
The sample uses RecognizeAsync, and it should be call RecognizeOnceAsync. It is currently just trying to return the FIRST FinalResult from the service. You should use Start/StopRecognizeAsync, and register to receive Result events.
再次抱歉,这里的文档不好,我们会尽快更新,并且可能会在刷新时重命名 API.
Again, sorry for the bad documentation here, we will update this soon, and also will rename the API probably in a refresh.
如果您有音频文件,也可以使用批量转录功能.也许这有帮助?https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/batch-transcription
If you have audio files, you could also use the batch transcription feature. Perhaps that helps? https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/batch-transcription
干杯沃尔夫冈
这篇关于语音到文本的大型音频文件 [Microsoft Speech API]的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!