Understanding batch-corrected counts in scVI

noraneko · March 9, 2025, 12:39am

@cane11, thanks again for your suggestions!

I’m still exploring the criticism module; the tutorial seems a bit outdated, and I’m not sure which metrics are the most informative for my case. Proper DE analysis (e.g., pseudobulked DESeq2) isn’t feasible for the small clusters, but comparing top markers per cluster per dataset before and after correction looks good. As I would expect, larger datasets show higher agreement, likely due to lower noise.

I realized that my original concern about the validity of analysis involving batch-corrected counts can be also asked in the context of DE analysis performed by scVI. From other forum discussions (e.g., here), I learned that batch-corrected counts can be used for DE. Additionally, from this documentation, I saw that in Scenario 2, we can specify identical sets of batches for group1 and group2 for DE.

If I compare young vs. aged samples across multiple datasets, would it be theoretically valid to set:
batchid1 = batchid2 = ['aged1_dataset1', 'young1_dataset1', 'aged1_dataset2', 'young1_dataset2']
That is, including representative samples from both cohorts across different datasets? If I understand correctly, the sampled counts would be conditioned on the average of the specified samples.

would this be a valid approach?
would it be valid to include young and aged samples from smaller datasets in the analysis, if they are small and unrepresentative and cannot be included in the batch list?

Would appreciate your thoughts on this!

Topic		Replies	Views
What is the best way to extract a "full" batch effect corrected count matrix from scVI model? scvi-tools scvi	4	3017	August 16, 2023
Compatibility between scVI and SCENIC scvi-tools integration , scvi	10	687	November 28, 2023
Differential Expression and Batch Correction scvi-tools scvi	1	171	February 20, 2025
Is the output of `get_normalized_expression` batch-corrected or not? scvi-tools integration , scvi	2	223	August 5, 2024
Insufficient batch correction for certain cell-types scvi-tools integration , scvi	8	427	May 15, 2024

Understanding batch-corrected counts in scVI

Related topics