Freezing layers: register_hook vs. requires_grad

yugeji · June 20, 2022, 10:09am

Hello,

Looking at the way scArches module freezing is implemented at scvi-tools/_archesmixin.py at 096099e34568b1bf94aad5273b9f92c202c9c755 · scverse/scvi-tools · GitHub

    for key, mod in module.named_modules():
        # skip over protected modules
        if key.split(".")[0] in mod_no_hooks_yes_grad:
            continue
        if isinstance(mod, FCLayers):
            hook_first_layer = False if no_hook_cond(key) else True
            mod.set_online_update_hooks(hook_first_layer)

It looks like set_online_update_hooks sets the (backwards?) gradients to zero and freezes FCLayers modules. If so, why do this in addition to the usual requires_grad=False setting for freezing layers? If not, what is it doing in the above call?

Thank you in advance for teaching me,
Yuge

adamgayoso · July 3, 2022, 11:22am

The issue is the input dims related to non-batch and batch categorical information are in the same linear layer. Decomposing that slows down the forward pass of the model in PyTorch (though Jax could make this simpler).

Therefore, the gradient hook is only applied to the appropriate input dims.

Topic		Replies	Views
FCLayers bias' gradient set to 0 scvi-tools	2	234	May 10, 2023
Mini-batching error after registering MuData .varm field with custom scvi-tools model scvi-tools scvi , developer	1	292	March 1, 2023
scPoli implementation in dcvi-tools scvi-tools	2	151	December 1, 2023
Query about transfer learning / scArches in SCVI scvi-tools	0	252	October 7, 2023
`tests/models/test_pyro.py::test_pyro_bayesian_save_load` fails scvi-tools scvi , developer	2	498	August 26, 2022

Freezing layers: register_hook vs. requires_grad

Related topics