grassp.pp.aggregate_proteins#
- aggregate_proteins(data, grouping_columns, agg_func=<function median>)[source]#
 Aggregates protein intensities across samples using a given function.
- Parameters:
 - data 
AnnData The annotated data matrix with proteins as observations (rows).
- grouping_columns 
Union[str,List[str]] Column name(s) in
data.obsto group samples into replicates.- agg_func 
Callable[[ndarray,Optional[int]],ndarray] (default:<function median at 0x109741230>) Function to use for aggregation. Defaults to
np.median.
- data 
 - Return type:
 - Returns:
 A new
AnnDataobject with aggregated expression values. The number of variables (samples) remains the same, but the number of observations (proteins) will correspond to the number of unique groups defined bygrouping_columns.
Notes
This function is useful for e.g. combining multiple proteins that belong to the same gene. For each protein, it groups the samples based on the provided
grouping_columnsand then aggregates the intensity values using the specifiedagg_func.