如何从python中的麦克风获取声音输入，并动态处理它？

Question

问候，

我正在尝试用 Python 编写一个程序，该程序每次在麦克风中轻敲时都会打印一个字符串。 当我说“敲击”时，我的意思是突然发出很大的噪音或类似的声音。

我在 SO 中搜索并找到了这篇文章：识别音频的音调

我认为 PyAudio 库会满足我的需要，但我不太确定如何让我的程序等待音频信号（实时麦克风监控），以及当我得到一个如何处理它（我是否需要使用傅立叶变换像它是在上面的帖子中指示的）？

预先感谢您能给我的任何帮助。

Answer 1

如果您使用的是 LINUX，则可以使用pyALSAAUDIO 。 对于 Windows，我们有PyAudio ，还有一个名为SoundAnalyse的库。

我在这里找到了一个 Linux 示例：

#!/usr/bin/python
## This is an example of a simple sound capture script.
##
## The script opens an ALSA pcm for sound capture. Set
## various attributes of the capture, and reads in a loop,
## Then prints the volume.
##
## To test it out, run it and shout at your microphone:

import alsaaudio, time, audioop

# Open the device in nonblocking capture mode. The last argument could
# just as well have been zero for blocking mode. Then we could have
# left out the sleep call in the bottom of the loop
inp = alsaaudio.PCM(alsaaudio.PCM_CAPTURE,alsaaudio.PCM_NONBLOCK)

# Set attributes: Mono, 8000 Hz, 16 bit little endian samples
inp.setchannels(1)
inp.setrate(8000)
inp.setformat(alsaaudio.PCM_FORMAT_S16_LE)

# The period size controls the internal number of frames per period.
# The significance of this parameter is documented in the ALSA api.
# For our purposes, it is suficcient to know that reads from the device
# will return this many frames. Each frame being 2 bytes long.
# This means that the reads below will return either 320 bytes of data
# or 0 bytes of data. The latter is possible because we are in nonblocking
# mode.
inp.setperiodsize(160)

while True:
    # Read data from device
    l,data = inp.read()
    if l:
        # Return the maximum of the absolute value of all samples in a fragment.
        print audioop.max(data, 2)
    time.sleep(.001)

Answer 2

...当我得到一个如何处理它时（我是否需要像上一篇文章中所指示的那样使用傅立叶变换）？

如果你想要一个“抽头”，那么我认为你对幅度比频率更感兴趣。 所以傅立叶变换可能对您的特定目标没有用。 您可能想要对输入的短期（比如 10 毫秒）幅度进行运行测量，并检测它何时突然增加某个增量。 您需要调整以下参数：

什么是“短期”幅度测量
您寻找的增量增量是多少
增量变化必须以多快的速度发生

虽然我说你对频率不感兴趣，但你可能想先做一些过滤，过滤掉特别是低频和高频成分。 这可能会帮助您避免一些“误报”。 你可以用 FIR 或 IIR 数字滤波器做到这一点； 傅立叶不是必需的。

Answer 3

我知道这是一个老问题，但如果有人再次在这里查看...请参阅https://python-sounddevice.readthedocs.io/en/0.4.1/index.html 。

它有一个很好的例子“输入到输出传递”在这里https://python-sounddevice.readthedocs.io/en/0.4.1/examples.html#input-to-output-pass-through 。

……还有很多其他的例子……

如何从python中的麦克风获取声音输入，并动态处理它？

问题描述

3 个解决方案

解决方案1
42 已采纳 2009-12-20 21:10:42

解决方案2
7 2009-12-20 23:42:20

解决方案3
4 2020-11-13 16:43:05

如何从python中的麦克风获取声音输入，并动态处理它？

问题描述

3 个解决方案

解决方案1 42 已采纳 2009-12-20 21:10:42

解决方案2 7 2009-12-20 23:42:20

解决方案3 4 2020-11-13 16:43:05

解决方案1
42 已采纳 2009-12-20 21:10:42

解决方案2
7 2009-12-20 23:42:20

解决方案3
4 2020-11-13 16:43:05