如何使用Python正确解码.wav

Question

I am coding a basic frequency analisys of WAVE audio files, but I have trouble when it comes to convertion from WAVE frames to integer. 我正在编写WAVE音频文件的基本频率分析，但是当涉及从WAVE帧转换为整数时我遇到了麻烦。

Here is the relevant part of my code: 这是我的代码的相关部分：

import wave
track = wave.open('/some_path/my_audio.wav', 'r')

byt_depth = track.getsampwidth() #Byte depth of the file in BYTES
frame_rate = track.getframerate()
buf_size = 512

def byt_sum (word):
#convert a string of n bytes into an int in [0;8**n-1]
    return sum( (256**k)*word[k] for k in range(len(word)) )

raw_buf = track.readframes(buf_size)
'''
One frame is a string of n bytes, where n = byt_depth.
For instance, with a 24bits-encoded file, track.readframe(1) could be:
b'\xff\xfe\xfe'.
raw_buf[n] returns an int in [0;255]
'''

sample_buf = [byt_sum(raw_buf[byt_depth*k:byt_depth*(k+1)])
              - 2**(8*byt_depth-1) for k in range(buf_size)]

Problem is: when I plot sample_buf for a single sine signal, I get an alternative, wrecked sine signal . 问题是：当我为单个正弦信号绘制sample_buf时，我得到一个替代的，破坏的正弦信号。 I can't figure out why the signal overlaps udpside-down. 我无法弄清楚为什么信号与udpside-down重叠。

Any idea? 任何的想法？

PS: Since I'm French, my English is quite hesitating. PS：因为我是法国人，我的英语非常犹豫。 Feel free to edit if there are ugly mistakes. 如果有丑陋的错误，请随时编辑。

Answer 1

It might be because you need to use an unsigned value for representing the 16bit samples. 这可能是因为您需要使用无符号值来表示16位样本。 See https://en.wikipedia.org/wiki/Pulse-code_modulation 请参阅https://en.wikipedia.org/wiki/Pulse-code_modulation

Try to add 32767 to each sample. 尝试为每个样本添加32767。

Also you should use the python struct module to decode the buffer. 您还应该使用python struct模块来解码缓冲区。

import struct
buff_size = 512
# 'H' is for unsigned 16 bit integer, try 'h' also
sample_buff = struct.unpack('H'*buf_size, raw_buf)

Answer 2

The easiest way is to use a library that does the decoding for you. 最简单的方法是使用一个为您解码的库。 There are several Python libraries available, my favorite is the soundfile module: 有几个Python库可用，我最喜欢的是soundfile模块：

import soundfile as sf
signal, samplerate = sf.read('/some_path/my_audio.wav')

如何使用Python正确解码.wav

问题描述

2 个解决方案

解决方案1
3 已采纳 2015-12-22 13:57:18

解决方案2
1 2016-07-30 10:07:08

如何使用Python正确解码.wav

问题描述

2 个解决方案

解决方案1 3 已采纳 2015-12-22 13:57:18

解决方案2 1 2016-07-30 10:07:08

解决方案1
3 已采纳 2015-12-22 13:57:18

解决方案2
1 2016-07-30 10:07:08