← Open in Shiplog

Machine learning

curiosity machine learning

subtree 7 descendants 6 findings 1 note

DNN theory through random-matrix lenses — free random projection, dynamical isometry, meta-RL, training dynamics.

Members (7)

themeDynamical isometry
Conditions under which signal propagation in deep networks preserves norms and gradients; spectral analysis of layerwise Jacobians and Fisher information.
1finding
Featured: The Spectrum of Fisher Information of Deep Networks Achieving Dynamical Isometry
methodOrthogonal initialization
Initializing weight matrices as random orthogonal matrices to preserve singular values.
themeDNN architectures as random-matrix systems
Reading deep architectures (MLP-Mixer, attention, sparse MLPs) through random-matrix and Kronecker-structure lenses to expose implicit regularization.
1finding
Featured: Understanding MLP-Mixer as a Wide and Sparse MLP
methodFree Random Projection
Random representation-based projection method for in-context and meta-reinforcement learning.
1finding
Featured: Free Random Projection for In-Context Reinforcement Learning
themeMeta reinforcement learning
Learning algorithms that adapt to new tasks from limited interaction.
Featured: Free Random Projection for In-Context Reinforcement Learning
themeReinforcement learning
Sequential decision-making under uncertainty — the umbrella over meta-RL adaptive learning and the VR-scene exploration policies in the adjacent thread.
themeInterpretability and training dynamics
Layer-wise interpretability via identity initialization, implicit bias of gradient regularization, and selective forgetting / unlearning.
3findings
Featured: Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias

Findings — subtree (6)

Papers (6)
- Free Random Projection for In-Context Reinforcement Learning2025· AISTATS· via Free Random Projection
- Understanding MLP-Mixer as a Wide and Sparse MLP2024· PMLR· via DNN architectures as random-matrix systems
- Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias2023· ICML· via Interpretability and training dynamics
- Layer-Wise Interpretation of Deep Neural Networks Using Idneity Initialization2021· IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)· via Interpretability and training dynamics
- The Spectrum of Fisher Information of Deep Networks Achieving Dynamical Isometry2021· AISTATS· via Dynamical isometry
- Selective Forgetting of Deep Networks at a Finer Level than Samples2020· AAAI RSEML· via Interpretability and training dynamics

Notes — subtree (1)

AISTATS2026に論文が2本採択されました2026-01-23· announcement· via Free Random Projection

Connections

No topic connections yet.