librosa.core.cqt¶
-
librosa.core.
cqt
(y, sr=22050, hop_length=512, fmin=None, n_bins=84, bins_per_octave=12, tuning=0.0, filter_scale=1, norm=1, sparsity=0.01, window=’hann’, scale=True, pad_mode=’reflect’)[source]¶ Compute the constant-Q transform of an audio signal.
This implementation is based on the recursive sub-sampling method described by [1].
[1] Schoerkhuber, Christian, and Anssi Klapuri. “Constant-Q transform toolbox for music processing.” 7th Sound and Music Computing Conference, Barcelona, Spain. 2010. Parameters: - y : np.ndarray [shape=(n,)]
audio time series
- sr : number > 0 [scalar]
sampling rate of y
- hop_length : int > 0 [scalar]
number of samples between successive CQT columns.
- fmin : float > 0 [scalar]
Minimum frequency. Defaults to C1 ~= 32.70 Hz
- n_bins : int > 0 [scalar]
Number of frequency bins, starting at fmin
- bins_per_octave : int > 0 [scalar]
Number of bins per octave
- tuning : None or float in [-0.5, 0.5)
Tuning offset in fractions of a bin (cents).
If None, tuning will be automatically estimated from the signal.
- filter_scale : float > 0
Filter scale factor. Small values (<1) use shorter windows for improved time resolution.
- norm : {inf, -inf, 0, float > 0}
Type of norm to use for basis function normalization. See
librosa.util.normalize
.- sparsity : float in [0, 1)
Sparsify the CQT basis by discarding up to sparsity fraction of the energy in each basis.
Set sparsity=0 to disable sparsification.
- window : str, tuple, number, or function
Window specification for the basis filters. See filters.get_window for details.
- scale : bool
If True, scale the CQT response by square-root the length of each channel’s filter. This is analogous to norm=’ortho’ in FFT.
If False, do not scale the CQT. This is analogous to norm=None in FFT.
- pad_mode : string
Padding mode for centered frame analysis.
See also:
librosa.core.stft
and np.pad.
Returns: - CQT : np.ndarray [shape=(n_bins, t), dtype=np.complex or np.float]
Constant-Q value each frequency at each time.
Raises: - ParameterError
If hop_length is not an integer multiple of 2**(n_bins / bins_per_octave)
Or if y is too short to support the frequency range of the CQT.
See also
Notes
This function caches at level 20.
Examples
Generate and plot a constant-Q power spectrum
>>> import matplotlib.pyplot as plt >>> y, sr = librosa.load(librosa.util.example_audio_file()) >>> C = librosa.cqt(y, sr=sr) >>> librosa.display.specshow(librosa.amplitude_to_db(C, ref=np.max), ... sr=sr, x_axis='time', y_axis='cqt_note') >>> plt.colorbar(format='%+2.0f dB') >>> plt.title('Constant-Q power spectrum') >>> plt.tight_layout()
Limit the frequency range
>>> C = librosa.cqt(y, sr=sr, fmin=librosa.note_to_hz('C2'), ... n_bins=60) >>> C array([[ 8.827e-04, 9.293e-04, ..., 3.133e-07, 2.942e-07], [ 1.076e-03, 1.068e-03, ..., 1.153e-06, 1.148e-06], ..., [ 1.042e-07, 4.087e-07, ..., 1.612e-07, 1.928e-07], [ 2.363e-07, 5.329e-07, ..., 1.294e-07, 1.611e-07]])
Using a higher frequency resolution
>>> C = librosa.cqt(y, sr=sr, fmin=librosa.note_to_hz('C2'), ... n_bins=60 * 2, bins_per_octave=12 * 2) >>> C array([[ 1.536e-05, 5.848e-05, ..., 3.241e-07, 2.453e-07], [ 1.856e-03, 1.854e-03, ..., 2.397e-08, 3.549e-08], ..., [ 2.034e-07, 4.245e-07, ..., 6.213e-08, 1.463e-07], [ 4.896e-08, 5.407e-07, ..., 9.176e-08, 1.051e-07]])