Redis + Audio Analysis for near-realtime fun

Go to file

alban 9887f62202 [fix] doc and bpm work		2020-09-29 00:56:54 +02:00
.gitignore	[fix] update for librosa 0.7.2	2020-09-28 14:40:39 +02:00
README.md	[fix] doc and bpm work	2020-09-29 00:56:54 +02:00
redilysis.py	[fix] doc and bpm work	2020-09-29 00:56:54 +02:00
requirements.txt	[fix] update for librosa 0.7.2	2020-09-28 14:40:39 +02:00

README.md

(Audio Analysis | redis ) == <3

Redilysis sends audio analysis to a redis server.

The idea is to share a single audio analysis to many Visual Jockey filters, in our case for lasers.

Two modes are available, so you might need to run two processes for full analysis.

Redis Keys and Contents

Each word in bold is a key which you can query the redis server for. Ex:

$ redis-cli get spectrum_120
"[2.21, 0.56, 0.51, 0.32, 0.27, 0.21, 0.18, 0.17, 0.18, 0.23]"

rms

Mode spectrum
Type float number
Length scalar
Meaning Represents the root-mean-square -a mean value- for all frequencies between C0 and C9, e.g. between 12Hz and 8,372Hz.
Use A fairly basic information about the scene audio volume.
Example
- "0.12"
- The audio volume for the scene is pretty low.
- It is obtained by averaging the RMS of every audio frame during the capture.

spectrum_10

Mode spectrum
Type array of float numbers (0.0-10.0)
Length 10
Meaning Represents the audio volume for the 10 octaves between C0 and C9, e.g. between 12Hz and 8,372Hz.
Use A simple and useful way to get a global idea of the sound landscape.
Example
- "[2.21, 0.56, 0.51, 0.32, 0.27, 0.21, 0.18, 0.17, 0.18, 0.23]"
- The audio volume for the C4 octave is spectrum_10[4].
- That value being 0.27 is pretty low meaning almost no audio volume for that octave.
- It is calculated by averaging the volume of the octave's notes, e.g. C4, D4, D#4, E4, F4, F#4, G4, G#4, A4, A#4, B4.

spectrum_120

Mode spectrum
Type array of float numbers (0.0-10.0)
Length 120
Meaning Represents the audio volume for the 120 notes between C0 and C9, e.g. between 12Hz and 8,372Hz.
Use More detailed than spectrum_10, it allows to find the standing out notes of the audio landscape.
Example
- "[5.55, 2.61, 2.49, 1.79, 2.09, 4.35, 1.99, 1.57, 1.47, 0.77, 0.91, 0.89, 0.85, 0.56, 0.53, 0.73, 0.53, 0.46, 0.43, 0.44, 0.27, 0.45, 0.7, 0.81, 0.98, 0.7, 0.71, 0.6, 0.83, 0.51, 0.32, 0.31, 0.33, 0.24, 0.25, 0.33, 0.39, 0.43, 0.51, 0.28, 0.27, 0.25, 0.38, 0.25, 0.27, 0.3, 0.2, 0.27, 0.35, 0.29, 0.34, 0.3, 0.27, 0.27, 0.22, 0.21, 0.21, 0.29, 0.22, 0.28, 0.18, 0.19, 0.25, 0.26, 0.25, 0.24, 0.2, 0.21, 0.19, 0.18, 0.19, 0.17, 0.2, 0.17, 0.18, 0.17, 0.15, 0.17, 0.19, 0.18, 0.21, 0.16, 0.16, 0.18, 0.15, 0.13, 0.14, 0.16, 0.2, 0.17, 0.17, 0.2, 0.18, 0.16, 0.18, 0.15, 0.15, 0.16, 0.16, 0.19, 0.19, 0.19, 0.17, 0.18, 0.17, 0.19, 0.23, 0.23, 0.2, 0.23, 0.24, 0.36, 0.34, 0.23, 0.22, 0.2, 0.19, 0.18, 0.21, 0.21]"
- The audio volume for the C2 note is spectrum_10[23] (12x2 - 1).
- That value being 0.81 is average meaning there is some audio volume for that octave.

bpm

Mode bpm
Type
Length
Meaning Represents
Use
Example

bpm_sample_interval

Mode bpm
Type
Length
Meaning Represents
Example

bpm_delay

Mode bpm
Type
Length
Meaning Represents
Example

beats

Mode bpm
Type
Length
Meaning Represents
Example

Requirements and installation

python 2.7
audio card
redis server

Installation

sudo apt install python-pyaudio python
git clone https://git.interhacker.space/tmplab/redilysis.git
cd redilysis
pip install -r requirements.txt
python redilysis.py --help

Running in Spectrum Mode

python redilysis.py -m spectrum

This is the default mode.

It performs some frequency analysis (Fast Fourier Transform) to detect "energy" in the human audition bandwidths.

It will record if there is sound and at which frequencies.

It can run at sub-second frequency (100ms) with no problem.

It reports realistic data: spectrum analysis is the easy part.

Running in BPM Mode

python redilysis.py -m bpm -s 0.5

This mode is experimental.

It attempts to detect beats based on complex parameters.