Preprocessing: pp

Preprocessing: pp#

Any transformation of the data matrix that is not a tool. Other than tools, preprocessing steps usually don’t return an easily interpretable annotation, but perform a basic transformation on the data matrix.

Basic Preprocessing#

pp.calculate_qc_metrics

Calculate quality control metrics.

pp.filter_samples

Filter samples based on number of counts or proteins.

pp.filter_proteins

Filter proteins based on number of counts or samples.

pp.highly_variable_proteins

Identify highly variable proteins.

pp.filter_proteins_per_replicate

Filter proteins based on detection in replicates.

pp.aggregate_proteins

Aggregate proteins based on grouping columns.

pp.aggregate_samples

Aggregate samples based on grouping columns.

pp.normalize_total

Normalize expression values for each sample to sum to a constant value.

pp.drop_excess_MQ_metadata

Drop excess metadata columns from MaxQuant output.

pp.remove_contaminants

Remove contaminant proteins from the data matrix.

Imputation#

pp.impute_gaussian

Impute missing values using a Gaussian distribution.