Is older advice for estimating optimal number of epochs for model training of scVI still recommended?

timslittle · February 5, 2026, 5:48pm

Hi Team,

Earlier docs for scVI mention that the number of epochs should scale downwards with number of obs/cells. E.g: “([For] > 100,000 cells) you may only need ~10 epochs.” I cannot find similar advice on the modern docs. Is this advice still valid and should even fewer epochs be used for even larger datasets?

Thank you in advance,

Tim

ori-kron-wis · February 8, 2026, 10:09am

I don’t think it’s relevant anymore, and it really depends on what you are doing.

You will need to train until convergence, and you can use the early_stopping parameter for that, but if you are starting from an already trained large model and only finetune it with your cells, yes might be that only a few epochs will be enough.

timslittle · February 9, 2026, 5:37pm

Thank you. If I am training a completely new model, what could be unintended consequences of not using the ‘early_stopping’ parameter to ensure convergence?

ori-kron-wis · February 9, 2026, 8:23pm

It will just train for as many epochs as you enter, possibly leading to overfitting.

You should check your loss curves while doing this.

cane11 · February 25, 2026, 11:26pm

Also for models trained on 30 million cells of the CELLxGENE census going below 10 epochs was not good for results. We saw little improvement beyond 10 epochs though but decided to train for 20 epochs.

timslittle · February 27, 2026, 7:55pm

Thank you, is it normal for it to keep improving for hundreds of epochs? I used early_stopping=True but it still did all 400 epochs. The loss curves seem quite normal to me:

ori-kron-wis · March 1, 2026, 8:13am

The curves you attached actually show overfit.

However, the reconstruction loss doesn’t tell the whole story as it is only one part of ELBO (the other is KL divergence), so probably the general loss or negative ELBO is still decreasing, and this is why, despite you using early_stopping, it made the whole 400 epochs.

Topic		Replies	Views
Motivation for `max_epochs = np.min([round((20000 / n_cells) * 400), 400])`? scvi-tools scvi	3	898	September 12, 2022
Model Training got only 2 epochs scvi-tools integration , scvi	4	385	October 14, 2024
How to compare different parameter sets using the validation loss? Help integration , scvi	6	1133	August 26, 2024
scVI data set size runtime question scvi-tools scvi	4	1286	February 18, 2022
Minimum number of cells for scVI? scvi-tools scvi	2	436	February 15, 2023

Is older advice for estimating optimal number of epochs for model training of scVI still recommended?

Related topics