简体   繁体   中英

Automatic audio sprite generator for web/javascript?

Audio sprites (several audio bites concatenated in one audio file) are getting common in javascript control of audio on the web. However, it takes quite a lot of "stupid" work to create and implement an audio sprite. Is there a tool or method by which you could do it automatically instead of "manually"?

Eg given a folder with audio files, I want a tool that generates

  1. An audio file with all the contents, preferably separated by a bit of silence.
  2. The onset and offset timings (in milliseconds) of each soundbite in the audiofile. Preferably, it would output the javascript sprite code itself!

Check out this excellent Node.js solution:

https://github.com/tonistiigi/audiosprite

Worked great for me!

This python script is a no-brainer way to generate audio sprites and appropriate javascript code for the web.

Features

  • No special modules need to be installed.
  • Concatenate audio files to a sprite, padded with silence
  • Generate javascript sprite information (timing)

Requirements:

  1. Only works on .wav as input and output. Convert to other file formats afterwards.
  2. All audio files should have identical properties (samplerate, channels etc.)

Python code:

silenceDuration = 0.2  # Seconds of silence between merged files
outfile = "soundsSprite.wav"  # Output file. Will be saved in the path below.
folder = "C:/Users/Jonas/Dropbox/Documents/cnru/programmering/html5 nback/stimuli_mess/audioletters/"

# Prepare...
import wave, os, glob
os.chdir(folder)
currentTime = 0
sprite = {}

# Open output file
output = wave.open(outfile, 'wb')

# Loop through files in folder and append to outfile
for i, infile in enumerate(glob.glob('*.wav')):
    # Skip the outfile itself
    if infile == outfile: continue

    # Open file and get info
    w = wave.open(folder + infile, 'rb')
    soundDuration = w.getnframes() / float(w.getframerate())

    # First file: determine general parameters- Create silence.
    if i == 0:
        output.setparams(w.getparams())
        silenceData = [0] * int(w.getframerate() * 2 * silenceDuration)  # N 0's where N are the number of samples corresponding to the duration specified in "silenceDuration"
        silenceFrames = "".join(wave.struct.pack('h', item) for item in silenceData)

    # Output sound + silence to file
    output.writeframes(w.readframes(w.getnframes()))
    output.writeframes(silenceFrames)
    w.close()

    # Create sprite metadata {'mysound.wav': [start_secs, end_secs]}. Then increment current time
    start = round(currentTime, 3)
    end = round(currentTime + soundDuration, 3)
    sprite[infile[:-4]] = [start, end]
    currentTime += soundDuration + silenceDuration

# Yay, the worst is behind us. Close output file
output.close()

# Output in the required format. Here for jquery.mb.audio
for filename, times in sprite.items():
    print '%s: {id: "%s", start: %.3f, end: %.3f, loop: false}, ' % (filename, filename, times[0], times[1])

** Output** I ran this for a number of audiofiles I had (letters read aloud) and got the following output:

Sprites variable:

{'AA': [0.449, 0.776], 'E': [3.149, 3.419], 'A': [0.0, 0.249], 'C': [2.113, 2.395], 'B': [1.554, 1.913], 'AE': [0.976, 1.354], 'D': [2.595, 2.949], 'G': [4.132, 4.554], 'F': [3.619, 3.932], 'H': [4.754, 4.972], 'K': [5.957, 6.258], 'J': [5.172, 5.757], 'L': [6.458, 6.719], 'O': [6.919, 7.133], 'Q': [8.488, 8.957], 'P': [7.853, 8.288], 'S': [9.681, 10.057], 'R': [9.157, 9.481], 'U': [10.694, 10.994], 'T': [10.257, 10.494], 'V': [11.194, 11.703], 'Y': [12.601, 12.93], 'X': [11.903, 12.401], 'Z': [13.13, 13.714], 'OE': [7.333, 7.653]}

... which was converted to jquery.mb.audio style:

AA: {id: "AA", start: 0.449, end: 0.776, loop: false}, 
E: {id: "E", start: 3.149, end: 3.419, loop: false}, 
A: {id: "A", start: 0.000, end: 0.249, loop: false}, 
C: {id: "C", start: 2.113, end: 2.395, loop: false}, 
B: {id: "B", start: 1.554, end: 1.913, loop: false}, 
AE: {id: "AE", start: 0.976, end: 1.354, loop: false}, 
D: {id: "D", start: 2.595, end: 2.949, loop: false}, 
G: {id: "G", start: 4.132, end: 4.554, loop: false}, 
F: {id: "F", start: 3.619, end: 3.932, loop: false}, 
H: {id: "H", start: 4.754, end: 4.972, loop: false}, 
K: {id: "K", start: 5.957, end: 6.258, loop: false}, 
J: {id: "J", start: 5.172, end: 5.757, loop: false}, 
L: {id: "L", start: 6.458, end: 6.719, loop: false}, 
O: {id: "O", start: 6.919, end: 7.133, loop: false}, 
Q: {id: "Q", start: 8.488, end: 8.957, loop: false}, 
P: {id: "P", start: 7.853, end: 8.288, loop: false}, 
S: {id: "S", start: 9.681, end: 10.057, loop: false}, 
R: {id: "R", start: 9.157, end: 9.481, loop: false}, 
U: {id: "U", start: 10.694, end: 10.994, loop: false}, 
T: {id: "T", start: 10.257, end: 10.494, loop: false}, 
V: {id: "V", start: 11.194, end: 11.703, loop: false}, 
Y: {id: "Y", start: 12.601, end: 12.930, loop: false}, 
X: {id: "X", start: 11.903, end: 12.401, loop: false}, 
Z: {id: "Z", start: 13.130, end: 13.714, loop: false}, 
OE: {id: "OE", start: 7.333, end: 7.653, loop: false},

Thanks to inspiration from: how to make a wav file with python and how to join wavefiles using python (tom10's answer)

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM