Hi all,
I’m working with CITE-seq data where 4 samples were multiplexed, stained with antibodies in a single tube, and then distributed across 4 sequencing lanes. I’d like advice on the optimal analysis workflow.
Specifically:
- At which step should I split by lanes and then merge adata objects from the same biological sample?
- Is the following processing order reasonable?
- Ambient RNA removal (CellBender)
- Doublet removal
- QC
- Protein normalization (DSB)
- HVG selection
- totalVI integration (raw count)
- Since CellBender, DSB, and totalVI all perform some form of denoising, are there any conflicts between them?
- Should I use DSB normalization before totalVI, or instead normalize denoised_protein values from totalVI afterwards?
Any recommendations, best practices, or reference papers would be greatly appreciated. Thanks!