grassp.pp.filter_proteins

grassp.pp.filter_proteins#

filter_proteins(data, *, min_counts=None, min_samples=None, max_counts=None, max_samples=None, inplace=True, copy=False)[source]#

Filter proteins based on number of counts or samples.

Parameters:
data AnnData | spmatrix | ndarray | Array

The annotated data matrix of shape n_obs x n_vars. Rows correspond to proteins and columns to samples.

min_counts Optional[int] (default: None)

Minimum number of counts required for a protein to pass filtering.

min_samples Optional[int] (default: None)

Minimum number of samples expressed required for a protein to pass filtering.

max_counts Optional[int] (default: None)

Maximum number of counts required for a protein to pass filtering.

max_samples Optional[int] (default: None)

Maximum number of samples expressed required for a protein to pass filtering.

inplace bool (default: True)

Perform computation inplace or return result.

copy bool (default: False)

If an AnnData is passed, determines whether a copy is returned.

Return type:

AnnData | tuple[ndarray, ndarray] | None

Returns:

Depending on inplace and input type, returns either: - None if inplace=True - AnnData if input is AnnData and inplace=False - A tuple of arrays (retained_proteins, retained_samples) if input is not AnnData