Best pre-processing steps for sc.tl.ingest()?

JuliaChristiaanse · July 20, 2023, 3:20pm

Hi everyone,

I have two questions regarding the pre-processing for the ingest tool of scanpy.

I want to use sc.tl.ingest() to map celltype labels from a reference dataset to my “query” dataset. This will be done per cell barcode instead of per cluster like in the scanpy vignette (Integrating data using ingest and BBKNN — Scanpy documentation). Manual annotation per leiden or louvain algorithm cluster is not an option for this dataset, all annotations are cell-level based.

Should the pre-processing steps I use for my reference dataset be the same as the pbmc3k tutorial (Preprocessing and clustering 3k PBMCs — Scanpy documentation) refered to in the vignette?

My query dataset consists of two individually processed AnnData objects, pre-processed like the pbmc3k tutorial and then integrated into one AnnData object with sc.external.harmony(). Are there any additional pre-processing steps I need to do before I pass this query dataset to ingest()?

I am pretty new to data annotation so all suggestions are welcome
Thanks!

Topic		Replies	Views
Tl.ingest not working (`sca.datasets` MWE) scanpy	2	403	June 21, 2022
Mapping a small reference dataset to a large dataset using scanpy ingest Help	0	281	August 25, 2022
Scanpy Ingest Understanding scRNA-seq	0	805	July 8, 2022
Two dataset after ingest, can I compare gene expression? scanpy	0	295	March 1, 2023
Transferring lables from refrence dataset to query dataset with a more diverse cell population using scANVI scvi-tools scanvi	8	130	October 17, 2024

Best pre-processing steps for sc.tl.ingest()?

Related topics