How to interpret reconstruction loss increase but elbo loss decrease on trained scvi and scanvi models

sckinta · May 9, 2025, 2:57pm

Hi, I am trying to use lung cancer single cell atlas (https://luca.icbi.at/) to annotate a Xenium experiment. Given Xenium is gene-panel based (~400 genes), I retrained atlas by subsetting features to 400, first training scvi model to integrate, then using semi-supervised scanvi model with cell type as label key. For both model, I trained 300 epochs. However, when examining the model metrics, I noticed elbo loss decrease but reconstruction loss increase. Here are the plots

I am wondering:

Is my planned workflow (subset features - reference scvi training- reference scanvi training - query-reference mapping) valid?
how do we interpret “reconstruction loss increase”? I am a little suspicious that the features from Xenium gene panel are not good to represent cell type. I am wondering whether this can be the reason.
With this quality of training, can i use the model to do reference mapping for Xenium data?

Thank you!!!

ori-kron-wis · May 11, 2025, 9:19am

yes
the problem where reconstruction loss increase and KL loss keep decreasing (or vice versa), is something known for cVAE’s.. and is a result of optimal parameters being optimzie “better” in the direction of one of those losses only (i.e other directions). however, the global ELBO is what you really need to look for and it does optimize to a final plateau.

Having said that, you can decide to stop the training after certain number of epochs and it may depends on your downstream task whether you prefer low reconstruction loss (a good generative model) or low KL local (likely better integration). In your case, increase in reconstruction loss means the model relays less on encoder latent variables and more on the decoder mapping (a sort of overfit).
I would suggest to play of some of the training plan parameter: lower learning rate, increase kl-warmup steps and lower max_kl_weight. Cant say apriori what will work.

I think that the figures can give us idea whether we trained enough and to the optimal point and did not overfit, but as to whether to say we can use to do reference mapping that will work then, I would just check the classification quality of the scanvi reference model validation set to get an idea.

Topic		Replies	Views
Interpreting model history curves scvi-tools peakvi	1	460	March 23, 2024
How to compare different parameter sets using the validation loss? Help integration , scvi	6	755	August 26, 2024
Suggestion on parameters for training scvi model scvi-tools integration , scvi	3	1698	December 4, 2023
Fine-Tuning scVI Model Help scvi	2	100	October 23, 2024
Assessing scVI fit by gene scvi-tools scvi , model-fit	3	514	June 23, 2021

How to interpret reconstruction loss increase but elbo loss decrease on trained scvi and scanvi models

Related topics