Motivation for `max_epochs = np.min([round((20000 / n_cells) * 400), 400])`?

mkarikom · August 31, 2022, 7:59pm

In reference to the default train() implementation, consider the case where adata contains:

40k cells → max_epochs=200
500 cells → max_epochs=400

My concern is that it seems like the risk of over-fitting would likely increase with fewer cells.

While the above behavior may follow lopez 2018, in which they note that “bigger datasets require fewer epochs”, it isn’t clear by what metric this was determined. In particular, whether they just used the training ELBO or whether they were able to diagnose [lack of] overfitting using eg marginal likelihood of a test set.

mkarikom · August 31, 2022, 10:07pm

Oh, I just realized scvi.train.Trainer has early stopping!
I guess the following should suffice:

model.train(early_stopping=True,early_stopping_monitor='reconstruction_loss_validation')

adamgayoso · September 11, 2022, 10:56pm

The heuristic is really just this – a heuristic. It allows scVI to give you something reasonable in less than hour for a large number of cells.

Indeed we also have early stopping. One thing to consider is that for larger datasets an epoch will take longer and thus the early stopping params would have to be changed. To the train methods you can add the param limit_train_batches which gets passed through to PyTorch Lightning. It seems reasonable to me to limit the train batches such that each epoch is around 50k to 100k cells.

mkarikom · September 12, 2022, 5:53pm

thanks, that makes sense!

Topic		Replies	Views
Model Training got only 2 epochs scvi-tools integration , scvi	4	180	October 14, 2024
scVI data set size runtime question scvi-tools scvi	4	1073	February 18, 2022
Minimum number of cells for scVI? scvi-tools scvi	2	352	February 15, 2023
SOLO/scVI .train() error related to batch_size? scvi-tools scvi	6	1304	February 20, 2025
Error when using scVI.model.train scvi-tools integration	2	151	August 19, 2024

Motivation for `max_epochs = np.min([round((20000 / n_cells) * 400), 400])`?

Related topics