Loading data into adata.obsm['airr'] of an existing scRNAseq object

JenC9292 · July 17, 2023, 7:59am

Hi!
I’m trying to load my TCR and BCR data into a pre-existing AnnData object containing scRNAseq gene expression. From the new update it appears that you can do this without needing to use MuData, is this correct and how do you load it in?

grst · July 23, 2023, 1:53pm

Thanks for your question!

First of all, are there any specific reasons not to use MuData? This is the recommended way of working with multimodal data in scverse, and I don’t think there are any downsides.

That said, if you want everything in a single AnnData object, you do the following:

load each modality into a separate AnnData object, e.g.

adata_gex = sc.read_10x_h5(...)
adata_tcr = ir.io.read_10x_vdj(...)
adata_bcr = ir.io.read_10x_vdj(...)

merge TCR and BCR into a single object

adata_airr = ir.pp.merge_airr(adata_tcr, adata_bcr)

Add AIRR data to GEX object, as described in the docs. This discards all cells from the AIRR object that do not have gene expression data:

# Map each cell barcode to its respective numeric index (assumes obs_names are unique)
barcode2idx = {barcode: i for i, barcode in enumerate(adata_airr.obs_names)}
# Generate a slice for the awkward array that retrieves the corresponding row
# from `adata_airr` for each barcode in `adata_gex`. `-1` will generate all
# "None"s for barcodes that are not in `adata_airr`
idx = [barcode2idx.get(barcode, -1) for barcode in adata_gex.obs_names]
adata_gex.obsm["airr"] = adata_airr.obsm["airr"][idx]

JenC9292 · July 24, 2023, 1:31am

Thanks for your reply! I was having issues with MuData as it expected VDJ data to be in a slot called “airr”, but your response on github showing how to specify airr_mod='tcr' when using both BCR and TCR data has resolved this. Thank you for highlighting the section of the docs though, I completely missed that bit.

Topic		Replies	Views
Problem with airr data loading scirpy	11	709	October 3, 2023
Integrating TRUST4 outputs into anndata GEX scirpy	3	49	January 22, 2025
CITEsq loading in RNA and ADT data scanpy	4	873	July 6, 2023
Not able to load TCR data scirpy	9	610	September 26, 2022
Bar plot using muon between scRNA-seq and scTCR-seq&scBCR-seq? scirpy integration	4	361	July 6, 2023

Loading data into adata.obsm['airr'] of an existing scRNAseq object

Related topics