How to compare different parameter sets using the validation loss?

Hi Yiwei,

I saw the exactly same thing on my dataset, did you find any explanation for this? I saw the other post mentioning:

Best,

swc