Feature extraction

Spectral features

chroma_stft([y, sr, S, norm, n_fft, …]) Compute a chromagram from a waveform or power spectrogram.
chroma_cqt([y, sr, C, hop_length, fmin, …]) Constant-Q chromagram
chroma_cens([y, sr, C, hop_length, fmin, …]) Computes the chroma variant “Chroma Energy Normalized” (CENS), following [R674badebce0d-1].
melspectrogram([y, sr, S, n_fft, …]) Compute a mel-scaled spectrogram.
mfcc([y, sr, S, n_mfcc]) Mel-frequency cepstral coefficients
rmse([y, S, frame_length, hop_length, …]) Compute root-mean-square (RMS) energy for each frame, either from the audio samples y or from a spectrogram S.
spectral_centroid([y, sr, S, n_fft, …]) Compute the spectral centroid.
spectral_bandwidth([y, sr, S, n_fft, …]) Compute p’th-order spectral bandwidth:
spectral_contrast([y, sr, S, n_fft, …]) Compute spectral contrast [R6ffcc01153df-1]
spectral_flatness([y, S, n_fft, hop_length, …]) Compute spectral flatness
spectral_rolloff([y, sr, S, n_fft, …]) Compute roll-off frequency
poly_features([y, sr, S, n_fft, hop_length, …]) Get coefficients of fitting an nth-order polynomial to the columns of a spectrogram.
tonnetz([y, sr, chroma]) Computes the tonal centroid features (tonnetz), following the method of [Recf246e5a035-1].
zero_crossing_rate(y[, frame_length, …]) Compute the zero-crossing rate of an audio time series.

Rhythm features

tempogram([y, sr, onset_envelope, …]) Compute the tempogram: local autocorrelation of the onset strength envelope.

Feature manipulation

delta(data[, width, order, axis, trim, mode]) Compute delta features: local estimate of the derivative of the input data along the selected axis.
stack_memory(data[, n_steps, delay]) Short-term history embedding: vertically concatenate a data vector or matrix with delayed copies of itself.