Azure认知服务 - 语音SDK - Python - 推送与拉动与异步音频流 [英] Azure Cognitive Service - Speech SDK - Python - Push vs Pull vs Asynchronous Audio Stream

查看:195
本文介绍了Azure认知服务 - 语音SDK - Python - 推送与拉动与异步音频流的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

执行一次性语音识别
异步来自音频文件的输入

performs one-shot speech recognition asynchronously with input from an audio file

举例说明如何使用拉动音频流识别来自自定义音频源的语音

gives an example how to use a pull audio stream to recognize speech from a custom audio source

给出一个示例如何使用
推送音频流来识别语音fr om自定义音频

gives an example how to use a push audio stream to recognize speech from a custom audio

这些是对来自此
的认知服务SDK示例中的不同函数的解释

These are the explanations for different functions given the in the Cognitive Service SDK examples from this

存储库https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/console/speech_sample.py

repository https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/console/speech_sample.py

你能成为吗?能够解释上述3个功能之间的区别吗? :)

Would you please be able to explain the difference between the 3 above functions ? :)

如果您能指出用于演示python程序的函数,我将不胜感激

I would also be grateful if you can point out which function to use to demonstrate a python program that

使用笔记本电脑的麦克风识别言语。非常感谢

uses a laptop's microphone to recognize speech. Many Thanks

推荐答案

Hello Mohamed,

Hello Mohamed,

这是一个快速入门  示例可以帮助您。我认为你提到的文件已经有一个示例方法来记录来自麦克风的
语音。

Here is a quickstart sample from azure cognitive services that can help you. I think the file you mentioned already has an example method to record voice from microphone.

我不是真正的不同音频流方法的专家但是根据一个
搜索我发现这是差异。

I am not really an expert on the different audio streaming method but according to one search I found this as the difference.

单词Pull and Push主要用于Windows Media Streaming。推送是将编码器程序的数据发送到Windows Media Streaming服务器,您需要用户考虑到这一点。

The words Pull and Push are mostly used with Windows Media Streaming. Push is sending data of an encoder program to a Windows Media Streaming server and you need a users account for that.

具有公共IP地址和端口号(tcp& udp)的Windows Media Server拉取信号。

Pull is something that the Windows Media Server with a public ip address and port number (tcp & udp) pulls the signal.


这篇关于Azure认知服务 - 语音SDK - Python - 推送与拉动与异步音频流的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆