Feature extraction¶
Spectral features¶
chroma_stft([y, sr, S, norm, n_fft, …]) |
Compute a chromagram from a waveform or power spectrogram. |
chroma_cqt([y, sr, C, hop_length, fmin, …]) |
Constant-Q chromagram |
chroma_cens([y, sr, C, hop_length, fmin, …]) |
Computes the chroma variant “Chroma Energy Normalized” (CENS), following [R674badebce0d-1]. |
melspectrogram([y, sr, S, n_fft, …]) |
Compute a mel-scaled spectrogram. |
mfcc([y, sr, S, n_mfcc]) |
Mel-frequency cepstral coefficients |
rmse([y, S, frame_length, hop_length, …]) |
Compute root-mean-square (RMS) energy for each frame, either from the audio samples y or from a spectrogram S. |
spectral_centroid([y, sr, S, n_fft, …]) |
Compute the spectral centroid. |
spectral_bandwidth([y, sr, S, n_fft, …]) |
Compute p’th-order spectral bandwidth: |
spectral_contrast([y, sr, S, n_fft, …]) |
Compute spectral contrast [R6ffcc01153df-1] |
spectral_flatness([y, S, n_fft, hop_length, …]) |
Compute spectral flatness |
spectral_rolloff([y, sr, S, n_fft, …]) |
Compute roll-off frequency |
poly_features([y, sr, S, n_fft, hop_length, …]) |
Get coefficients of fitting an nth-order polynomial to the columns of a spectrogram. |
tonnetz([y, sr, chroma]) |
Computes the tonal centroid features (tonnetz), following the method of [Recf246e5a035-1]. |
zero_crossing_rate(y[, frame_length, …]) |
Compute the zero-crossing rate of an audio time series. |
Rhythm features¶
tempogram([y, sr, onset_envelope, …]) |
Compute the tempogram: local autocorrelation of the onset strength envelope. |
Feature manipulation¶
delta(data[, width, order, axis, trim, mode]) |
Compute delta features: local estimate of the derivative of the input data along the selected axis. |
stack_memory(data[, n_steps, delay]) |
Short-term history embedding: vertically concatenate a data vector or matrix with delayed copies of itself. |