Converting view of a backed anndata object into non-view/non-backed

deto · November 12, 2023, 8:08pm

What is the preferred way to load only a subset of cells into an anndata in memory? The workflow I’m thinking of is:

Load anndata object as backed
Based on .obs table, select a subset of genes
Load only that subset into an in-memory anndata

If I subset the backed anndata and then call .copy, because it is backed, the .copy function needs a file to save the results to. However this is unnecessary in my case - I just want to load the subset into memory.

I could do something like:

ad_subset_view = ad[indices]
ad_subset = anndata.AnnData(ad_subset_view.X, obs=ad_subset_view.obs, ....)

But I’m wondering if there is a more canonical way to do this? And I have the same question for mudata objects.

deto · November 13, 2023, 6:10pm

Update:

I found the AnnData.to_memory() function which appears to do this. Could be good to include this in the tutorials when talking about worked with backed anndata.

I don’t see any equivalent in mudata, however, but leveraging the to_memory() in anndata made it easy to write a custom function for this.

ergonyc · January 16, 2024, 4:36am

Thanks for the update. I’m trying to make a “lazy” AnnData loader from backed files. I think this will be useful.

Topic		Replies	Views
Memory Usage in multiple New Formats anndata	0	27	April 13, 2025
Subset on adata.uns["X"].var - help please! anndata	1	307	May 8, 2024
How to make and save the subset of a specific cluster of the data? anndata	2	441	June 16, 2022
[AnnData] Lazily create .obsm on disk anndata	4	480	May 10, 2022
Subsetting anndata using genelist anndata	4	4093	May 5, 2024

Converting view of a backed anndata object into non-view/non-backed

Related topics