Hi, I am trying to use lung cancer single cell atlas (https://luca.icbi.at/) to annotate a Xenium experiment. Given Xenium is gene-panel based (~400 genes), I retrained atlas by subsetting features to 400, first training scvi model to integrate, then using semi-supervised scanvi model with cell type as label key. For both model, I trained 300 epochs. However, when examining the model metrics, I noticed elbo loss decrease but reconstruction loss increase. Here are the plots
I am wondering:
- Is my planned workflow (subset features - reference scvi training- reference scanvi training - query-reference mapping) valid?
- how do we interpret “reconstruction loss increase”? I am a little suspicious that the features from Xenium gene panel are not good to represent cell type. I am wondering whether this can be the reason.
- With this quality of training, can i use the model to do reference mapping for Xenium data?
Thank you!!!