Remove outliers from trained scVI model

joseph-siefert · March 16, 2024, 3:23am

Thanks for the great framework! On occasion when I build a model for a particular cell type, the majority of the cells cluster together while some are severe outliers and clearly not of the same cell type. I can easily remove these cells, however then they are absent from the trained model, so I cannot load the model with that dataset or perform differential expression. Is there a simple way to update the trained model with the outliers excluded, without having to train a new model?

Valentine_Svensson · March 19, 2024, 10:12pm

You should be able to use the model in any way regardless if you have removed some cells from the dataset.

What kind of error are you seeing?

joseph-siefert · March 20, 2024, 12:58am

I could still use it, but the cells, which are clearly outliers, are still included and will affect analysis performed with the model. For example, if I recompute neighbors from the latent space the projections only get worse. The model still seems to still retain information regarding the outlier cells in the latent space. My concern is this also affects the batch conditioning, normalization, and DEG. Similarly, in order to perform DE I would need to do some fancy indexing to exclude the outlier cells. Would be much easier and cleaner if it was possible to update the model.

cane11 · March 23, 2024, 9:22pm

You can pass indices to most downstream tasks or an AnnData object. If you pass your filtered objects you will get the results on only those cells. In the DE function there is an ‘importance’ weighting. Enabling this downweights the effect that these outliers will have without removing those. This downweighting is described in: https://www.pnas.org/doi/full/10.1073/pnas.2209124120

Topic		Replies	Views
Scvi and subtyping scvi-tools integration , scvi	1	385	February 9, 2024
Scvi model training after sub-clustering scvi-tools integration , scvi	1	218	April 25, 2024
Scvi - denoising single-cell/single-nucleus transcription data scvi-tools scvi	3	239	August 8, 2024
Doublet removal: solo model train error scvi-tools scvi	13	115	August 29, 2024
Fine-Tuning scVI Model Help scvi	2	99	October 23, 2024

Remove outliers from trained scVI model

Related topics