sof/tools/tune/mfcc
Seppo Ingalsuo b1c996b21b Tools: Tune: MFCC: Fix channels handling in audio feature plotter
In test topologies the MFCC data can be packed to 1, 2, or 4
channels stream. This change fixes the shown time scale for
audio features 3D plot.

Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>
2024-08-21 14:39:05 +01:00
..
README.txt
decode_ceps.m Tools: Tune: MFCC: Fix channels handling in audio feature plotter 2024-08-21 14:39:05 +01:00
run_mfcc.sh
setup_mfcc.m

README.txt

This directory contains a tool to create configuration blob for SOF
MFCC component. It's simply run in Matlab or Octave with command
"setup_mfcc". The MFCC configuration parameters can be edited from the
script.

The configuration can be test run with testbench. First the test topologies
need to be created with "scripts/build-tools.sh -t". Next the testbench
is build with "scripts/rebuild-testbench.sh".

Once the previous steps are done, a sample wav file can be processed
into stream of cepstral coefficients with script run_mfcc.sh. E.g.
next command processes an ALSA test file with speech clip "front center".
The output file is hard-coded to mfcc.raw.

./run_mfcc.sh /usr/share/sounds/alsa/Front_Center.wav

The output can be plotted and retrieved with Matlab or Octave command:

[ceps, t, n] = decode_ceps('mfcc.raw', 13);

In the above it's known from configuration script that MFCC was set up to
output 13 cepstral coefficients from each FFT -> Mel -> DCT -> Cepstral
coefficients computation run.

Other kind of signals have quite big visual difference in audio features. Try
e.g. other sound files found in computer.

./run_mfcc.sh /usr/share/sounds/gnome/default/alerts/bark.ogg
./run_mfcc.sh /usr/share/sounds/gnome/default/alerts/sonar.ogg