Interpretability and training dynamics
Layer-wise interpretability via identity initialization, implicit bias of gradient regularization, and selective forgetting / unlearning.
Findings (3)
Connections
This topic …
contrasts withDynamical isometry
Layer-wise interpretability via identity initialization, implicit bias of gradient regularization, and selective forgetting / unlearning.
contrasts withDynamical isometry