Hello!
I recently upgraded scvi from version 1.1.2 to 1.3.0 and attempted to reproduce a one vs all DE test across different cell types. I got drastically different results. I noticed in the release notes that there were changes made to the implementation of the DE test and was wondering if those changes affected the results. I saw that the default mode was changed to “change” but I had been manually setting mode=“change” all along, so I don’t believe this is the source of the difference. Here is the code I used for the original results (with version 1.1.2) and the new results (with version 1.3.0) (same code for both).
de_df = model.differential_expression(
adata=adata_filtered,
groupby='cell_type',
mode = "change",
delta = 0.25,
fdr_target = 0.05
)
The original results (version 1.1.2) of the one vs all DE test seemed to be pretty accurate reflections of known marker genes across cell types. The new results don’t seem to be as accurate.
One cell type that had drastically different DE results between the two package versions, which I will use as an example here, is Tfh. In the original results, over 2,000 of the 5,000 highly variable genes in my dataset were differentially expressed at an fdr of 0.05. In the new results, only 3 genes were differentially expressed. In general, the new results show lower bayes factors and lower lfc medians. I want to clarify that the non-zeros proportions for all cell types are identical between the original results and the new results so I am confident that there was not any mixup on my end of different input data. I will attach the results for 3 marker genes that showed up as differentially expressed in the original results but that did not in the new results to show the overall trend I’m seeing.
Do you have any recommendations for what to do? Maybe going back to a previous version of scvi or changing DE test parameters in the current version? Any further insight into interpreting the updates that were made in the latest version would be greatly appreciated.
Thank you!
Julianne