Label transfer from CITE-seq

FelixTheStudent · September 6, 2022, 12:46pm

Hi all,

I’m looking for a way to transfer labels from a CITE-seq reference to a CITE-seq query. Initially I thought this should not be much different than the usual workflow with scvi or scArches. I thought I’d simply replace scVI with totalVI in one of these tutorials and am done.

But then I realized that the from_scvi_model function does not have a corresponding function for totalVI, so I’m stuck.

Is there a simple solution that I have missed? If not, why is it harder to transfer labels for CITE-seq?

Thanks for the great work!

Best,

Felix

adamgayoso · September 11, 2022, 10:54pm

So this is correct that as of now you cannot train a classifier on top of totalVI in a way that also affects the encoder (as in a model analogous to scANVI for totalVI).

In this tutorial I train a RF classifier on top of the latent space and show how to store it as an attribute of the model class (so it saves/loads).

I also want to add that this is a requested feature so we can look into it more as an enhancement this fall.

github.com/scverse/scvi-tools

Labels Key in totalVI

opened 03:27PM - 01 Jun 22 UTC

StefanMDPhD

enhancement

I've written this a few times in the past, but I can't say this often enough: Th…ank you for sharing your excellent analysis software with the community! I'm writing to inquire about the possibility of exposing the labels_key in the totalVI model. My reason for coveting this feature is the following: I have CITE-seq data from cells for which I have some a priori information on coarse cell type, i.e. my libraries consist of two different cell types that I've sorted (via FACS) and labeled with hash tag olives before multiplexing them into a single library. [Perhaps anecdotally] I notice that when I integrate several of these libraries via scVI, using batch and label keys, the boundary between the two cell types remain crisp. When I try this with totalVI, using batch key [and the additional CITE-seq data] there is more mixing between the [closely related] cell types. Thanks in advance for considering this request.

FelixTheStudent · September 12, 2022, 8:00am

Thanks, this is great!

For my understanding: If the encoder does not get retrained, what happens in your tutorial’s section Query model training? I imagine the network weights learned on the reference are simply applied to the raw counts of the query to compute the latent space. Or is something more complex going on, such as (re)training the decoder?

Thanks again!
Best,

Felix

adamgayoso · September 12, 2022, 2:34pm

What is happening is that the encoder sees the batch categories and is fine-tuning weights for the new query batch categories.

This is described in the scArches paper:

FelixTheStudent · September 15, 2022, 1:45pm

The tutorial you shared does not load scArches, and also does not call any model from it. Are you saying it still somehow use scArches in its section ‘Query model training’ (linked in my last post)? I feel this is a crucial point for me to understand how the scvi-tools and scArches modules work together.

Thanks!

Felix

adamgayoso · September 16, 2022, 7:12pm

load_query_data is precisely doing scArches architecture surgery. Any scvi-tools model that is being used from the scarches codebase is just calling the code in scvi-tools, so there is really no difference between the packages except the few models that are unique to scarches codebase.

Topic		Replies	Views
Vae_q.latent_space_classifer_ not found in totalVI CITE-seq reference mapping tutorial scvi-tools totalvi	3	208	November 20, 2023
Label transfer with SCVI-SCANVI pipeline changes (predicts wrong) labels in ref data scvi-tools scanvi , scvi	8	1017	July 31, 2023
Cannot transfer setup without `extend_categories = True` scvi-tools	2	1148	December 15, 2022
Add new data to existing integration scvi-tools	3	65	March 7, 2025
Error when trying to use scvi.model.SCANVI.from_scvi_model scvi-tools	7	225	July 22, 2025

Label transfer from CITE-seq

Related topics