WMA Voice decoding context. More...

Data Fields
Global values specified in the stream header / extradata or used all over.
GetBitContext	gb
	packet bitreader. More...

int8_t	vbm_tree [25]
	converts VLC codes to frame type More...

int	spillover_bitsize
	number of bits used to specify spillover_nbits in the packet header = ceil(log2(ctx->block_align << 3)) More...

int	history_nsamples
	number of samples in history for signal prediction (through ACB) More...

int	do_apf
	whether to apply the averaged projection filter (APF) More...

int	denoise_strength
	strength of denoising in Wiener filter [0-11] More...

int	denoise_tilt_corr
	Whether to apply tilt correction to the Wiener filter coefficients (postfilter) More...

int	dc_level
	Predicted amount of DC noise, based on which a DC removal filter is used. More...

int	lsps
	number of LSPs per frame [10 or 16] More...

int	lsp_q_mode
	defines quantizer defaults [0, 1] More...

int	lsp_def_mode
	defines different sets of LSP defaults [0, 1] More...

int	min_pitch_val
	base value for pitch parsing code More...

int	max_pitch_val
	max value + 1 for pitch parsing More...

int	pitch_nbits
	number of bits used to specify the pitch value in the frame header More...

int	block_pitch_nbits
	number of bits used to specify the first block's pitch value More...

int	block_pitch_range
	range of the block pitch More...

int	block_delta_pitch_nbits
	number of bits used to specify the delta pitch between this and the last block's pitch value, used in all but first block More...

int	block_delta_pitch_hrange
	1/2 range of the delta (full range is from -this to +this-1) More...

uint16_t	block_conv_table [4]
	boundaries for block pitch unit/scale conversion More...

Packet values specified in the packet header or related to a packet.
A packet is considered to be a single unit of data provided to this decoder by the demuxer.
int	spillover_nbits
	number of bits of the previous packet's last superframe preceding this packet's first full superframe (useful for re-synchronization also) More...

int	has_residual_lsps
	if set, superframes contain one set of LSPs that cover all frames, encoded as independent and residual LSPs; if not set, each frame contains its own, fully independent, LSPs More...

int	skip_bits_next
	number of bits to skip at the next call to wmavoice_decode_packet() (since they're part of the previous superframe) More...

uint8_t	sframe_cache [SFRAME_CACHE_MAXSIZE+AV_INPUT_BUFFER_PADDING_SIZE]
	cache for superframe data split over multiple packets More...

int	sframe_cache_size
	set to >0 if we have data from an (incomplete) superframe from a previous packet that spilled over in the current packet; specifies the amount of bits in sframe_cache More...

PutBitContext	pb
	bitstream writer for sframe_cache More...

Frame and superframe values
Superframe and frame data - these can change from frame to frame, although some of them do in that case serve as a cache / history for the next frame or superframe.
double	prev_lsps [MAX_LSPS]
	LSPs of the last frame of the previous superframe. More...

int	last_pitch_val
	pitch value of the previous frame More...

int	last_acb_type
	frame type [0-2] of the previous frame More...

int	pitch_diff_sh16
	((cur_pitch_val - last_pitch_val) << 16) / MAX_FRAMESIZE More...

float	silence_gain
	set for use in blocks if ACB_TYPE_NONE More...

int	aw_idx_is_ext
	whether the AW index was encoded in 8 bits (instead of 6) More...

int	aw_pulse_range
	the range over which aw_pulse_set1() can apply the pulse, relative to the value in aw_first_pulse_off. More...

int	aw_n_pulses [2]
	number of AW-pulses in each block; note that this number can be negative (in which case it basically means "zero") More...

int	aw_first_pulse_off [2]
	index of first sample to which to apply AW-pulses, or -0xff if unset More...

int	aw_next_pulse_off_cache
	the position (relative to start of the second block) at which pulses should start to be positioned, serves as a cache for pitch-adaptive window pulses between blocks More...

int	frame_cntr
	current frame index [0 - 0xFFFE]; is only used for comfort noise in pRNG() More...

int	nb_superframes
	number of superframes in current packet More...

float	gain_pred_err [6]
	cache for gain prediction More...

float	excitation_history [MAX_SIGNAL_HISTORY]
	cache of the signal of previous superframes, used as a history for signal generation More...

float	synth_history [MAX_LSPS]
	see excitation_history More...

Postfilter values
Variables used for postfilter implementation, mostly history for smoothing and so on, and context variables for FFT/iFFT.
RDFTContext	rdft

RDFTContext	irdft
	contexts for FFT-calculation in the postfilter (for denoise filter) More...

DCTContext	dct

DCTContext	dst
	contexts for phase shift (in Hilbert transform, part of postfilter) More...

float	sin [511]

float	cos [511]
	8-bit cosine/sine windows over [-pi,pi] range More...

float	postfilter_agc
	gain control memory, used in adaptive_gain_control() More...

float	dcf_mem [2]
	DC filter history. More...

float	zero_exc_pf [MAX_SIGNAL_HISTORY+MAX_SFRAMESIZE]
	zero filter output (i.e. More...

float	denoise_filter_cache [MAX_FRAMESIZE]

int	denoise_filter_cache_size
	samples in denoise_filter_cache More...

float	tilted_lpcs_pf [0x80]
	aligned buffer for LPC tilting More...

float	denoise_coeffs_pf [0x80]
	aligned buffer for denoise coefficients More...

float	synth_filter_out_buf [0x80+MAX_LSPS_ALIGN16]
	aligned buffer for postfilter speech synthesis More...

Detailed Description

WMA Voice decoding context.

Definition at line 131 of file wmavoice.c.

Field Documentation

◆ gb

GetBitContext WMAVoiceContext::gb

packet bitreader.

During decoder init, it contains the extradata from the demuxer. During decoding, it contains packet data.

Definition at line 136 of file wmavoice.c.

Referenced by parse_packet_header(), synth_superframe(), wmavoice_decode_init(), and wmavoice_decode_packet().

◆ vbm_tree

int8_t WMAVoiceContext::vbm_tree[25]

converts VLC codes to frame type

Definition at line 140 of file wmavoice.c.

Referenced by synth_frame(), and wmavoice_decode_init().

◆ spillover_bitsize

int WMAVoiceContext::spillover_bitsize

number of bits used to specify spillover_nbits in the packet header = ceil(log2(ctx->block_align << 3))

Definition at line 142 of file wmavoice.c.

Referenced by parse_packet_header(), and wmavoice_decode_init().

◆ history_nsamples

int WMAVoiceContext::history_nsamples

number of samples in history for signal prediction (through ACB)

Definition at line 145 of file wmavoice.c.

Referenced by synth_frame(), synth_superframe(), wmavoice_decode_init(), and wmavoice_flush().

◆ do_apf

int WMAVoiceContext::do_apf

whether to apply the averaged projection filter (APF)

Definition at line 149 of file wmavoice.c.

Referenced by synth_frame(), synth_superframe(), wmavoice_decode_end(), wmavoice_decode_init(), and wmavoice_flush().

◆ denoise_strength

int WMAVoiceContext::denoise_strength

strength of denoising in Wiener filter [0-11]

Definition at line 151 of file wmavoice.c.

Referenced by calc_input_response(), and wmavoice_decode_init().

◆ denoise_tilt_corr

int WMAVoiceContext::denoise_tilt_corr

Whether to apply tilt correction to the Wiener filter coefficients (postfilter)

Definition at line 153 of file wmavoice.c.

Referenced by calc_input_response(), and wmavoice_decode_init().

◆ dc_level

int WMAVoiceContext::dc_level

Predicted amount of DC noise, based on which a DC removal filter is used.

Definition at line 155 of file wmavoice.c.

Referenced by postfilter(), and wmavoice_decode_init().

◆ lsps

int WMAVoiceContext::lsps

number of LSPs per frame [10 or 16]

Definition at line 158 of file wmavoice.c.

Referenced by postfilter(), synth_block(), synth_frame(), synth_superframe(), wiener_denoise(), wmavoice_decode_init(), and wmavoice_flush().

◆ lsp_q_mode

int WMAVoiceContext::lsp_q_mode

defines quantizer defaults [0, 1]

Definition at line 159 of file wmavoice.c.

Referenced by synth_superframe(), and wmavoice_decode_init().

◆ lsp_def_mode

int WMAVoiceContext::lsp_def_mode

defines different sets of LSP defaults [0, 1]

Definition at line 160 of file wmavoice.c.

Referenced by synth_superframe(), and wmavoice_decode_init().

◆ min_pitch_val

int WMAVoiceContext::min_pitch_val

base value for pitch parsing code

Definition at line 163 of file wmavoice.c.

Referenced by kalman_smoothen(), synth_frame(), and wmavoice_decode_init().

◆ max_pitch_val

int WMAVoiceContext::max_pitch_val

max value + 1 for pitch parsing

Definition at line 164 of file wmavoice.c.

Referenced by kalman_smoothen(), synth_frame(), and wmavoice_decode_init().

◆ pitch_nbits

int WMAVoiceContext::pitch_nbits

number of bits used to specify the pitch value in the frame header

Definition at line 165 of file wmavoice.c.

Referenced by synth_frame(), and wmavoice_decode_init().

◆ block_pitch_nbits

int WMAVoiceContext::block_pitch_nbits

number of bits used to specify the first block's pitch value

Definition at line 167 of file wmavoice.c.

Referenced by synth_frame(), and wmavoice_decode_init().

◆ block_pitch_range

int WMAVoiceContext::block_pitch_range

range of the block pitch

Definition at line 169 of file wmavoice.c.

Referenced by synth_frame(), and wmavoice_decode_init().

◆ block_delta_pitch_nbits

int WMAVoiceContext::block_delta_pitch_nbits

number of bits used to specify the delta pitch between this and the last block's pitch value, used in all but first block

Definition at line 170 of file wmavoice.c.

Referenced by synth_frame(), and wmavoice_decode_init().

◆ block_delta_pitch_hrange

int WMAVoiceContext::block_delta_pitch_hrange

1/2 range of the delta (full range is from -this to +this-1)

Definition at line 174 of file wmavoice.c.

Referenced by synth_frame(), and wmavoice_decode_init().

◆ block_conv_table

uint16_t WMAVoiceContext::block_conv_table[4]

boundaries for block pitch unit/scale conversion

Definition at line 176 of file wmavoice.c.

Referenced by synth_frame(), and wmavoice_decode_init().

◆ spillover_nbits

int WMAVoiceContext::spillover_nbits

number of bits of the previous packet's last superframe preceding this packet's first full superframe (useful for re-synchronization also)

Definition at line 188 of file wmavoice.c.

Referenced by parse_packet_header(), and wmavoice_decode_packet().

◆ has_residual_lsps

int WMAVoiceContext::has_residual_lsps

if set, superframes contain one set of LSPs that cover all frames, encoded as independent and residual LSPs; if not set, each frame contains its own, fully independent, LSPs

Definition at line 192 of file wmavoice.c.

Referenced by parse_packet_header(), and synth_superframe().

◆ skip_bits_next

int WMAVoiceContext::skip_bits_next

number of bits to skip at the next call to wmavoice_decode_packet() (since they're part of the previous superframe)

Definition at line 197 of file wmavoice.c.

Referenced by wmavoice_decode_packet(), and wmavoice_flush().

◆ sframe_cache

uint8_t WMAVoiceContext::sframe_cache[SFRAME_CACHE_MAXSIZE+AV_INPUT_BUFFER_PADDING_SIZE]

cache for superframe data split over multiple packets

Definition at line 201 of file wmavoice.c.

Referenced by synth_superframe(), and wmavoice_decode_packet().

◆ sframe_cache_size

int WMAVoiceContext::sframe_cache_size

set to >0 if we have data from an (incomplete) superframe from a previous packet that spilled over in the current packet; specifies the amount of bits in sframe_cache

Definition at line 204 of file wmavoice.c.

Referenced by synth_superframe(), wmavoice_decode_packet(), and wmavoice_flush().

◆ pb

PutBitContext WMAVoiceContext::pb

bitstream writer for sframe_cache

Definition at line 209 of file wmavoice.c.

Referenced by wmavoice_decode_packet().

◆ prev_lsps

double WMAVoiceContext::prev_lsps[MAX_LSPS]

LSPs of the last frame of the previous superframe.

Definition at line 220 of file wmavoice.c.

Referenced by synth_superframe(), wmavoice_decode_init(), and wmavoice_flush().

◆ last_pitch_val

int WMAVoiceContext::last_pitch_val

pitch value of the previous frame

Definition at line 222 of file wmavoice.c.

Referenced by synth_block_fcb_acb(), synth_frame(), and wmavoice_decode_init().

◆ last_acb_type

int WMAVoiceContext::last_acb_type

frame type [0-2] of the previous frame

Definition at line 223 of file wmavoice.c.

Referenced by synth_frame(), and wmavoice_decode_init().

◆ pitch_diff_sh16

int WMAVoiceContext::pitch_diff_sh16

((cur_pitch_val - last_pitch_val) << 16) / MAX_FRAMESIZE

Definition at line 224 of file wmavoice.c.

Referenced by synth_block_fcb_acb(), and synth_frame().

◆ silence_gain

float WMAVoiceContext::silence_gain

set for use in blocks if ACB_TYPE_NONE

Definition at line 226 of file wmavoice.c.

Referenced by synth_block_fcb_acb(), synth_block_hardcoded(), and synth_frame().

◆ aw_idx_is_ext

int WMAVoiceContext::aw_idx_is_ext

whether the AW index was encoded in 8 bits (instead of 6)

Definition at line 228 of file wmavoice.c.

Referenced by aw_parse_coords(), and aw_pulse_set1().

◆ aw_pulse_range

int WMAVoiceContext::aw_pulse_range

the range over which aw_pulse_set1() can apply the pulse, relative to the value in aw_first_pulse_off.

The exact position of the first AW-pulse is within [pulse_off, pulse_off + this], and depends on bitstream values; [16 or 24]

Definition at line 230 of file wmavoice.c.

Referenced by aw_parse_coords(), aw_pulse_set1(), and aw_pulse_set2().

◆ aw_n_pulses

int WMAVoiceContext::aw_n_pulses[2]

number of AW-pulses in each block; note that this number can be negative (in which case it basically means "zero")

Definition at line 236 of file wmavoice.c.

Referenced by aw_parse_coords(), aw_pulse_set1(), and aw_pulse_set2().

◆ aw_first_pulse_off

int WMAVoiceContext::aw_first_pulse_off[2]

index of first sample to which to apply AW-pulses, or -0xff if unset

Definition at line 239 of file wmavoice.c.

Referenced by aw_parse_coords(), aw_pulse_set1(), and aw_pulse_set2().

◆ aw_next_pulse_off_cache

int WMAVoiceContext::aw_next_pulse_off_cache

the position (relative to start of the second block) at which pulses should start to be positioned, serves as a cache for pitch-adaptive window pulses between blocks

Definition at line 241 of file wmavoice.c.

Referenced by aw_pulse_set2().

◆ frame_cntr

int WMAVoiceContext::frame_cntr

current frame index [0 - 0xFFFE]; is only used for comfort noise in pRNG()

Definition at line 247 of file wmavoice.c.

Referenced by synth_block_fcb_acb(), synth_block_hardcoded(), and synth_frame().

◆ nb_superframes

int WMAVoiceContext::nb_superframes

number of superframes in current packet

Definition at line 249 of file wmavoice.c.

Referenced by wmavoice_decode_packet().

◆ gain_pred_err

float WMAVoiceContext::gain_pred_err[6]

cache for gain prediction

Definition at line 250 of file wmavoice.c.

Referenced by synth_block_fcb_acb(), synth_block_hardcoded(), and wmavoice_flush().

◆ excitation_history

float WMAVoiceContext::excitation_history[MAX_SIGNAL_HISTORY]

cache of the signal of previous superframes, used as a history for signal generation

Definition at line 251 of file wmavoice.c.

Referenced by synth_superframe(), and wmavoice_flush().

◆ synth_history

float WMAVoiceContext::synth_history[MAX_LSPS]

see excitation_history

Definition at line 255 of file wmavoice.c.

Referenced by synth_superframe(), and wmavoice_flush().

◆ rdft

RDFTContext WMAVoiceContext::rdft

Definition at line 265 of file wmavoice.c.

Referenced by calc_input_response(), wiener_denoise(), wmavoice_decode_end(), and wmavoice_decode_init().

◆ irdft

RDFTContext WMAVoiceContext::irdft

contexts for FFT-calculation in the postfilter (for denoise filter)

Definition at line 265 of file wmavoice.c.

Referenced by calc_input_response(), wiener_denoise(), wmavoice_decode_end(), and wmavoice_decode_init().

◆ dct

DCTContext WMAVoiceContext::dct

Definition at line 267 of file wmavoice.c.

Referenced by calc_input_response(), wmavoice_decode_end(), and wmavoice_decode_init().

◆ dst

DCTContext WMAVoiceContext::dst

contexts for phase shift (in Hilbert transform, part of postfilter)

Definition at line 267 of file wmavoice.c.

Referenced by calc_input_response(), wmavoice_decode_end(), and wmavoice_decode_init().

◆ sin

float WMAVoiceContext::sin[511]

Definition at line 269 of file wmavoice.c.

Referenced by calc_input_response(), and wmavoice_decode_init().

◆ cos

float WMAVoiceContext::cos[511]

8-bit cosine/sine windows over [-pi,pi] range

Definition at line 269 of file wmavoice.c.

Referenced by calc_input_response(), and wmavoice_decode_init().

◆ postfilter_agc

float WMAVoiceContext::postfilter_agc

gain control memory, used in adaptive_gain_control()

Definition at line 271 of file wmavoice.c.

Referenced by postfilter(), and wmavoice_flush().

◆ dcf_mem

float WMAVoiceContext::dcf_mem[2]

DC filter history.

Definition at line 273 of file wmavoice.c.

Referenced by postfilter(), and wmavoice_flush().

◆ zero_exc_pf

float WMAVoiceContext::zero_exc_pf[MAX_SIGNAL_HISTORY+MAX_SFRAMESIZE]

zero filter output (i.e.

excitation) by postfilter

Definition at line 274 of file wmavoice.c.

Referenced by synth_frame(), synth_superframe(), and wmavoice_flush().

◆ denoise_filter_cache

float WMAVoiceContext::denoise_filter_cache[MAX_FRAMESIZE]

Definition at line 277 of file wmavoice.c.

Referenced by wiener_denoise(), and wmavoice_flush().

◆ denoise_filter_cache_size

int WMAVoiceContext::denoise_filter_cache_size

samples in denoise_filter_cache

Definition at line 278 of file wmavoice.c.

Referenced by wiener_denoise().

◆ tilted_lpcs_pf

float WMAVoiceContext::tilted_lpcs_pf[0x80]

aligned buffer for LPC tilting

Definition at line 279 of file wmavoice.c.

Referenced by wiener_denoise().

◆ denoise_coeffs_pf

float WMAVoiceContext::denoise_coeffs_pf[0x80]

aligned buffer for denoise coefficients

Definition at line 281 of file wmavoice.c.

Referenced by wiener_denoise().

◆ synth_filter_out_buf

float WMAVoiceContext::synth_filter_out_buf[0x80+MAX_LSPS_ALIGN16]

aligned buffer for postfilter speech synthesis

Definition at line 283 of file wmavoice.c.

Referenced by postfilter(), and wmavoice_flush().

The documentation for this struct was generated from the following file:

libavcodec/wmavoice.c

Data Fields

Detailed Description

Field Documentation

◆ gb

◆ vbm_tree

◆ spillover_bitsize

◆ history_nsamples

◆ do_apf

◆ denoise_strength

◆ denoise_tilt_corr

◆ dc_level

◆ lsps

◆ lsp_q_mode

◆ lsp_def_mode

◆ min_pitch_val

◆ max_pitch_val

◆ pitch_nbits

◆ block_pitch_nbits

◆ block_pitch_range

◆ block_delta_pitch_nbits

◆ block_delta_pitch_hrange

◆ block_conv_table

◆ spillover_nbits

◆ has_residual_lsps

◆ skip_bits_next

◆ sframe_cache

◆ sframe_cache_size

◆ pb

◆ prev_lsps

◆ last_pitch_val

◆ last_acb_type

◆ pitch_diff_sh16

◆ silence_gain

◆ aw_idx_is_ext

◆ aw_pulse_range

◆ aw_n_pulses

◆ aw_first_pulse_off

◆ aw_next_pulse_off_cache

◆ frame_cntr

◆ nb_superframes

◆ gain_pred_err

◆ excitation_history

◆ synth_history

◆ rdft

◆ irdft

◆ dct

◆ dst

◆ sin

◆ cos

◆ postfilter_agc

◆ dcf_mem

◆ zero_exc_pf

◆ denoise_filter_cache

◆ denoise_filter_cache_size

◆ tilted_lpcs_pf

◆ denoise_coeffs_pf

◆ synth_filter_out_buf