scVI dropout, need for it?

Nusob888 · January 9, 2024, 6:14pm

If any of the team working on scVI could help this query that’d be amazing.

I am just checking whether I am right in my interpretation of the base code, that the dropout occurs both in the encoder and decoder layers?

If that is the case, I just wanted to check if any internal validation works have been done on performance dependent on dropout values?

My vague understanding of VAEs (which isn’t particularly good), is that dropout isn’t necessarily a good thing? as it may affect the latent representation.

From a small trial of my data, it seems the umap embeddings look “better” when it is set to zero. I know this isn’t a good way of judging a model, but there aren’t really any consensus metrics and I find that autotune embeddings (while optimise for the lowest loss value), often result in overly smoothed umap plots where it is incredibly hard to distinguish cell types from each other.

martinkim0 · January 9, 2024, 9:15pm

I believe we only have dropout by default in the encoder (see the encoder code here and decoder here).

We include dropout in scVI primarily for regularization - in other words, it’s helping the model not overfit on the training dataset. Of course this depends on the data, but generally speaking removing dropout leads to an increase in the validation loss and/or worse embeddings on the validation set, while decreasing training loss.

If you see that removing dropout leads to better visualization of the data, then go for it, I don’t see anything wrong with that. I would just be careful that the model is performing adequately on the validation set.

Nusob888 · January 10, 2024, 5:38pm

Thanks Martin, you were right, the validation loss was worse. Thanks, this was helpful

Topic		Replies	Views
Validation loss lower than the training loss in scvi scvi-tools integration , scvi	7	1218	June 7, 2023
Query about transfer learning / scArches in SCVI scvi-tools	0	252	October 7, 2023
Embedding number for visualization scvi-tools integration , scvi	10	72	September 23, 2024
Train scVI on a sampled dataset scvi-tools	8	116	December 6, 2024
Batch Integration Parameter Tuning scvi-tools integration , gene-selection , scvi , modeling	1	630	March 2, 2022

scVI dropout, need for it?

Related topics