如何从Python中的麦克风获取声音输入,并即时进行处理? [英] How get sound input from microphone in python, and process it on the fly?

查看:2818
本文介绍了如何从Python中的麦克风获取声音输入,并即时进行处理?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

问候,

我正在尝试用Python编写一个程序,该程序每次在麦克风中被点击时都会打印一个字符串.当我说轻按"时,是指突然的大声喧similar或类似的声音.

I'm trying to write a program in Python which would print a string every time it gets a tap in the microphone. When I say 'tap', I mean a loud sudden noise or something similar.

我在SO中进行搜索,发现了以下信息:识别音频的音调

I searched in SO and found this post: Recognising tone of the audio

我认为PyAudio库可以满足我的需求,但是我不太确定如何使我的程序等待音频信号(实时麦克风监视),以及当我得到一个如何处理它时(我是否需要使用像上面帖子中指示的那样进行傅立叶变换?)

I think PyAudio library would fit my needs, but I'm not quite sure how to make my program wait for an audio signal (realtime microphone monitoring), and when I got one how to process it (do I need to use Fourier Transform like it was instructed in the above post)?

在此先感谢您能给我的帮助.

Thank you in advance for any help you could give me.

推荐答案

如果使用的是LINUX,则可以使用 pyALSAAUDIO . 对于Windows,我们有 PyAudio ,还有一个名为

If you are using LINUX, you can use pyALSAAUDIO. For windows, we have PyAudio and there is also a library called SoundAnalyse.

我在此处找到了一个示例

#!/usr/bin/python
## This is an example of a simple sound capture script.
##
## The script opens an ALSA pcm for sound capture. Set
## various attributes of the capture, and reads in a loop,
## Then prints the volume.
##
## To test it out, run it and shout at your microphone:

import alsaaudio, time, audioop

# Open the device in nonblocking capture mode. The last argument could
# just as well have been zero for blocking mode. Then we could have
# left out the sleep call in the bottom of the loop
inp = alsaaudio.PCM(alsaaudio.PCM_CAPTURE,alsaaudio.PCM_NONBLOCK)

# Set attributes: Mono, 8000 Hz, 16 bit little endian samples
inp.setchannels(1)
inp.setrate(8000)
inp.setformat(alsaaudio.PCM_FORMAT_S16_LE)

# The period size controls the internal number of frames per period.
# The significance of this parameter is documented in the ALSA api.
# For our purposes, it is suficcient to know that reads from the device
# will return this many frames. Each frame being 2 bytes long.
# This means that the reads below will return either 320 bytes of data
# or 0 bytes of data. The latter is possible because we are in nonblocking
# mode.
inp.setperiodsize(160)

while True:
    # Read data from device
    l,data = inp.read()
    if l:
        # Return the maximum of the absolute value of all samples in a fragment.
        print audioop.max(data, 2)
    time.sleep(.001)

这篇关于如何从Python中的麦克风获取声音输入,并即时进行处理?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆