Posterior probability of being assigned to a specific label

Florian_Deckert · July 7, 2021, 3:04pm

Hi, many thanks for your great tool!

I tried using the seed labeling on the Tusi 2018 data and their marker genes from Supp table 1. I followed strictly your tutorial (https://docs.scvi-tools.org/en/stable/user_guide/notebooks/seed_labeling.html) and the first results look promising. However, I would like to get the posterior probability score which you used in your original publication (e.g. Figure 6 D).

On that note, is there a implemented way to dismiss labels if they have no support based on the trained model?

Is it possible to extract it from the model? I tried dir(scvi.model._scanvi.SCANVI) but could not find.

Many thanks,
Florian

adamgayoso · July 7, 2021, 5:11pm

So I think you just want to add soft=True to the predict method of SCANVI. However, we do have a bug in this where it’s not correctly outputting a dataframe.

github.com/YosefLab/scvi-tools

The scANVI prediction results in an array, not a data frame.

opened 09:41AM - 30 Jun 21 UTC

liuzj039

bug

Thank developers for bringing such a powerful tool. When using scANVI's model… to predict results, if the parameter is set to soft, an array will be returned instead of a dataframe ```python scanviModel.predict(soft=True) ``` This appears to have been caused by a typo. https://github.com/YosefLab/scvi-tools/blob/bc415d8a48195cbe7d145abd240961d8b5a10b0d/scvi/model/_scanvi.py#L299 The return value here should be `pred` instead of `y_pred`

Florian_Deckert · July 9, 2021, 12:58pm

Hello Adam, many thanks for your reply! So I added the following lines:

y_pred = scanvi_model.predict(adata, soft=True)
pred = pd.DataFrame(data=y_pred[0:,0:])
pred_score = pred.max(axis=1).to_numpy()

I assume that pred_score is now the maximum score across all labels for each cell. Which should correspond to the label assigned to that cell.

Background: I annotated progenitor cells with SingleR and use the top 10 SingleR labels as seed labels with scanvi. That is the comparison of the scvi vs SingleR score.

adamgayoso · July 15, 2021, 10:57pm

Your process is correct. We just released the new version which fixes the issue with the soft prediction, so I recommend updating to it.

Florian_Deckert · July 28, 2021, 7:00am

Great, many thanks for your tools and dedication!

Topic		Replies	Views
Scanvi best practices scvi-tools scanvi	6	1602	November 28, 2022
SCANVI soft labeling scvi-tools scanvi	1	883	August 8, 2022
Label Transfer Discrepancy in scANVI Model Training scvi-tools	2	405	January 22, 2024
Label transfer with SCVI-SCANVI pipeline changes (predicts wrong) labels in ref data scvi-tools scanvi , scvi	8	1009	July 31, 2023
Error on scvi_model.train(100) for seed label transfer scvi-tools scanvi	1	2106	June 8, 2022

Posterior probability of being assigned to a specific label

Related topics