sof/mfcc at 1e5b284ea2e6cc19972176d89ec3d7845260207f - sof

History

Seppo Ingalsuo b1c996b21b Tools: Tune: MFCC: Fix channels handling in audio feature plotter In test topologies the MFCC data can be packed to 1, 2, or 4 channels stream. This change fixes the shown time scale for audio features 3D plot. Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>		2024-08-21 14:39:05 +01:00
..
README.txt	…
decode_ceps.m	Tools: Tune: MFCC: Fix channels handling in audio feature plotter	2024-08-21 14:39:05 +01:00
run_mfcc.sh	…
setup_mfcc.m	…

README.txt

This directory contains a tool to create configuration blob for SOF
MFCC component. It's simply run in Matlab or Octave with command
"setup_mfcc". The MFCC configuration parameters can be edited from the
script.

The configuration can be test run with testbench. First the test topologies
need to be created with "scripts/build-tools.sh -t". Next the testbench
is build with "scripts/rebuild-testbench.sh".

Once the previous steps are done, a sample wav file can be processed
into stream of cepstral coefficients with script run_mfcc.sh. E.g.
next command processes an ALSA test file with speech clip "front center".
The output file is hard-coded to mfcc.raw.

./run_mfcc.sh /usr/share/sounds/alsa/Front_Center.wav

The output can be plotted and retrieved with Matlab or Octave command:

[ceps, t, n] = decode_ceps('mfcc.raw', 13);

In the above it's known from configuration script that MFCC was set up to
output 13 cepstral coefficients from each FFT -> Mel -> DCT -> Cepstral
coefficients computation run.

Other kind of signals have quite big visual difference in audio features. Try
e.g. other sound files found in computer.

./run_mfcc.sh /usr/share/sounds/gnome/default/alerts/bark.ogg
./run_mfcc.sh /usr/share/sounds/gnome/default/alerts/sonar.ogg