Scvi.data.organize_multiome_anndatas with two big anndata objects

niuyw · May 1, 2024, 9:45am

Hi,

I am trying to use MultiVI with two different anndata objects for paired scRNA and scATAC. In the organize_multiome_anndatas step, I was following the tutorial from the singl-cell best practice

adata_paired = ad.concat([rna.copy().T, atac.copy().T]).T
adata_paired.obs = adata_paired.obs.join(rna.obs[["cell_type", "batch"]])
adata_paired.obs["modality"] = "paired"
adata_paired

adata_mvi = scvi.data.organize_multiome_anndatas(adata_paired)

But this ad.concat([rna.copy().T, atac.copy().T]).T step requires huge memory and my jobs were always killed by the system. I was wondering if there are some ways to “bypass” this step.

Thanks in advance!

cane11 · May 3, 2024, 8:46pm

Hi, first try would be:

adata = ad.concat([rna, atac], axis=1)

There is also a function to do the merging on disk (anndata.experimental.concat_on_disk — anndata 0.1.dev50+gb3763f8 documentation). It also makes sense to subset to highly variable genes beforehand.

Topic		Replies	Views
Anndata.concatenate() with two 10x multiome datasets? anndata integration , multivi	2	643	December 29, 2022
How to concatenate anndata properly? anndata scrna-seq , integration , scvi	2	8226	November 3, 2022
Help with concat anndata	2	991	May 1, 2024
How to concatenate spatial AnnData objects squidpy	4	1464	August 15, 2023
totalVI workflow scvi-tools totalvi	12	682	August 1, 2021

Scvi.data.organize_multiome_anndatas with two big anndata objects

Related topics