Nonsense UMAP when including categorical covariates in a MULTIVI model

bsierieb1 · August 26, 2023, 6:10pm

Hi,

I am working with 10X multiome data. I first built an scvi and a peakvi model on the RNA and ATAC portion separately to get a general sense of each modality. Both worked fine, but I also discovered that it was absolutely essential to include total read counts per cell as a covariate in the peakvi model, otherwise the UMAP was just a single thick curve with cells ordered by sequencing depth. However, when I tried to use both modalities at the same time and when I included total ATAC read counts per cell (and/or other, RNA-based QC metrics) as covariate(s) in the multivi model, the resulting UMAP was just a homogeneous round cloud. Not including any categorical covariates works, but I see pronounced sequencing depth gradients, which makes me think that I do need to include them as covariates. Could you please advise if there is something wrong with my strategy or whether I am not specifying covariates correctly?

Thanks!

scvi.model.MULTIVI.setup_anndata(adata,
                                 layer = 'counts',
                                 continuous_covariate_keys = [ 'total_ATAC_counts'])

my_model = scvi.model.MULTIVI(adata,
                              n_genes = (adata.var["modality"] == "Gene Expression").sum(),
                              n_regions = (adata.var["modality"] == "Peaks").sum())

my_model.train()

adata.obsm["X_multivi"] = my_model.get_latent_representation()

sc.pp.neighbors(adata,
                use_rep = "X_multivi")

sc.tl.umap(adata,
           spread = 2)

sc.pl.umap(adata,
           color='total_ATAC_counts',
           color_map='plasma_r',
           size=2)

martinkim0 · August 28, 2023, 9:02pm

Hi, would you be able to include additional details such as scVI, PeakVI, and MultiVI UMAPs, as well as possibly plots for training and validation loss?

Topic		Replies	Views
Differential expression and differential accessibility with MultiVi scvi-tools	3	715	January 19, 2022
Protocol for model optimization (currently focused on MultiVI) scvi-tools multivi	3	552	May 14, 2025
multiVI and totalVI modal integration question scvi-tools scvi , multivi , totalvi	0	474	September 15, 2022
Integration of Multiple Multiome Datasets Multiome integration , multivi , totalvi	5	704	March 6, 2024
Weird UMAP after running scVI scvi-tools	6	282	September 12, 2024

Nonsense UMAP when including categorical covariates in a MULTIVI model

Related topics