Scanvi best practices

crg · July 1, 2021, 1:53pm

Hi,

I’ve been reviewing the latest changes in SCANVI. I noted the removal of pretraining scvi and that the option to do pretraining was moved to the from_scvi_model() method.

Could you elaborate on whether it is still best practice to pretrain with scvi, or whether just using scanvi on its own is better?

Thanks
Charlotte

adamgayoso · July 1, 2021, 2:58pm

It is still best practice to pretrain a SCVI model and then instantiate SCANVI with the from_scvi_model class method. We moved this around for API reasons.

crg · July 1, 2021, 3:37pm

Thanks for the quick response!

grst · September 15, 2021, 7:02am

Does it make a difference if I already add the labels_key when training the SCVI model, i.e.

scvi.data.setup_anndata(adata, batch_key='batch', labels_key="seed_labels")
scvi_model = scvi.model.SCVI(adata)
scvi_model.train()
scanvi_model = scvi.model.SCANVI.from_scvi_model(scvi_model, 'Unknown')
scanvi_model.train()

or if I add the labels only when running SCANVI?

scvi.data.setup_anndata(adata, batch_key="batch")
scvi_model = scvi.model.SCVI(adata)
scvi_model.train()
scvi.data.setup_anndata(adata, batch_key="batch", labels_key="seed_labels")
scanvi_model = scvi.model.SCANVI.from_scvi_model(scvi_model, "Unknown", adata=adata)
scanvi_model.train()

The former method is used in the “seed labelling” tutorial, the latter in the “atlas-level integration” tutorial.

adamgayoso · September 15, 2021, 3:14pm

@grst either way will work, setup anndata just creates a dictionary in adata.uns["_scvi"] and the labels won’t do anything to SCVI. This will be more explicit in a future release.

kanefos · November 28, 2022, 8:30pm

Hi,

I have a similar question. If I have a scvi model trained with one labels_key, but then later want to use this scvi model to create a scanvi model but predicting a different set of labels (a new labels_key) - is this possible? Can I replace the original labels_key?

adamgayoso · November 28, 2022, 10:17pm

In this case, it’s better not to provide the labels key to scvi, as the only thing this enables is gene-label specific dispersion parameters.

If you do provide the labels_key, what you want will not be possible; however, if you only provide the labels at the time of scanvi initialization it is possible

Topic		Replies	Views
How to create scANVI model from a scVI model without labels_key? Help scanvi	2	125	June 5, 2024
Issue with retrain scANVI model scvi-tools scanvi	1	47	March 3, 2025
Scvi labels_key scvi-tools scvi	1	630	October 25, 2022
Labels_key not available for scvi.model.MULTIVI.setup_anndata() scvi-tools multivi	3	438	March 1, 2022
Error when trying to use scvi.model.SCANVI.from_scvi_model scvi-tools	2	208	July 12, 2024

Scanvi best practices

Related topics