sklearn.utils.sparsefuncs
.incr_mean_variance_axis¶
-
sklearn.utils.sparsefuncs.
incr_mean_variance_axis
(X, axis, last_mean, last_var, last_n)[source]¶ Compute incremental mean and variance along an axix on a CSR or CSC matrix.
last_mean, last_var are the statistics computed at the last step by this function. Both must be initialized to 0-arrays of the proper size, i.e. the number of features in X. last_n is the number of samples encountered until now.
Parameters: - X : CSR or CSC sparse matrix, shape (n_samples, n_features)
Input data.
- axis : int (either 0 or 1)
Axis along which the axis should be computed.
- last_mean : float array with shape (n_features,)
Array of feature-wise means to update with the new data X.
- last_var : float array with shape (n_features,)
Array of feature-wise var to update with the new data X.
- last_n : int with shape (n_features,)
Number of samples seen so far, excluded X.
Returns: - means : float array with shape (n_features,)
Updated feature-wise means.
- variances : float array with shape (n_features,)
Updated feature-wise variances.
- n : int with shape (n_features,)
Updated number of seen samples.
Notes
NaNs are ignored in the algorithm.