Azure认知服务 - 语音SDK - Python - 推送与拉动与异步音频流 [英] Azure Cognitive Service - Speech SDK - Python - Push vs Pull vs Asynchronous Audio Stream
问题描述
执行一次性语音识别
异步来自音频文件的输入
performs one-shot speech recognition asynchronously with input from an audio file
举例说明如何使用拉动音频流识别来自自定义音频源的语音
gives an example how to use a pull audio stream to recognize speech from a custom audio source
给出一个示例如何使用
推送音频流来识别语音fr om自定义音频
gives an example how to use a push audio stream to recognize speech from a custom audio
这些是对来自此
的认知服务SDK示例中的不同函数的解释
These are the explanations for different functions given the in the Cognitive Service SDK examples from this
存储库https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/console/speech_sample.py
repository https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/console/speech_sample.py
你能成为吗?能够解释上述3个功能之间的区别吗? :)
Would you please be able to explain the difference between the 3 above functions ? :)
如果您能指出用于演示python程序的函数,我将不胜感激
I would also be grateful if you can point out which function to use to demonstrate a python program that
使用笔记本电脑的麦克风识别言语。非常感谢
uses a laptop's microphone to recognize speech. Many Thanks
推荐答案
Hello Mohamed,
Hello Mohamed,
这是一个快速入门 示例可以帮助您。我认为你提到的文件已经有一个示例方法来记录来自麦克风的
语音。
Here is a quickstart sample from azure cognitive services that can help you. I think the file you mentioned already has an example method to record voice from microphone.
我不是真正的不同音频流方法的专家但是根据一个
搜索我发现这是差异。
I am not really an expert on the different audio streaming method but according to one search I found this as the difference.
单词Pull and Push主要用于Windows Media Streaming。推送是将编码器程序的数据发送到Windows Media Streaming服务器,您需要用户考虑到这一点。
The words Pull and Push are mostly used with Windows Media Streaming. Push is sending data of an encoder program to a Windows Media Streaming server and you need a users account for that.
具有公共IP地址和端口号(tcp& udp)的Windows Media Server拉取信号。
Pull is something that the Windows Media Server with a public ip address and port number (tcp & udp) pulls the signal.
这篇关于Azure认知服务 - 语音SDK - Python - 推送与拉动与异步音频流的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!