Sorry if this is a naive question, but I’m trying to check my understanding of the PROGENy approach to pathway analysis, and scverse/decoupleR’s implementation of it.
In the original PROGENy paper, I see that the PROGENy weights (the values in the model in scverse, right?) are regression coefficients that they found from fitting “pathway activity” vs “per-gene expression levels”.
(And when they did their fit, the “pathway activities” were 0 or 1, and the expressions were z-scores).
That would align with how scverse/decoupleR is getting PROGENy pathway activities from expression data (running an mlm, and using the t-values as the activities). So I think that makes sense to me.
But, in the “PROGENy Scores” section of the paper, they seem to explain that you should get the “pathway scores” by simple matrix multiplication (i.e. weighted sum of the gene expression, weighted by the PROGENy weights). And this approach seems to agree with usage of PROGENy that I see in the literature (e.g. this paper from the Saez-Rodriguez group from 2020)
Are these two approaches actually equivalent, and I’m not seeing it? Or are they answering two different questions, and I need to be careful which approach I take?
(Thanks for such a great ecosystem of tools, BTW!)