Handling Non-Matching Categorical Labels in scArches

yojetsharma · September 13, 2024, 8:46pm

I’m using scArches to compare gene expression profiles between disease and normal samples. My query dataset contains cells from stem cells of healthy individuals and patients, labeled as WT, LSP2, and LSP3. The reference dataset includes samples labeled Sample-1, Sample-2, and Sample-3, derived from healthy brains.

The issue is that the categorical labels in my query and reference datasets do not directly match, and they represent fundamentally different conditions.

How should I handle and map these non-matching categorical labels to ensure compatibility for analysis in scArches?

Specifically:

Should I create a mapping that reflects the biological context of each sample?
Are there best practices for aligning categories when they represent different conditions or experimental setups?

Topic		Replies	Views
scArches with multiple covariates scvi-tools integration , scvi , scarches	9	766	September 22, 2024
Add new data to existing integration scvi-tools	3	64	March 7, 2025
Using a model with categorical_covariate_key instead of batch_key scvi-tools	2	542	February 1, 2024
Don't extend labels for query data scvi-tools scanvi	1	302	March 6, 2023
Is it possible to add extra covariate categories to a trained SCVI model? scvi-tools scrna-seq , integration , scanvi , scvi	3	85	October 4, 2024

Handling Non-Matching Categorical Labels in scArches

Related topics