g-benton / hessian-eff-dimLinks
Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisited
☆37Updated 2 years ago
Alternatives and similar repositories for hessian-eff-dim
Users that are interested in hessian-eff-dim are comparing it to the libraries listed below
Sorting:
- An empirical investigation of deep learning theory☆16Updated 6 years ago
- Geometric Certifications of Neural Nets☆42Updated 2 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆57Updated 4 years ago
- ☆50Updated 4 years ago
- Monotone operator equilibrium networks☆53Updated 5 years ago
- Limitations of the Empirical Fisher Approximation☆48Updated 7 months ago
- Codebase for Learning Invariances in Neural Networks☆96Updated 3 years ago
- Code for "Bridging the Gap between f-GANs and Wasserstein GANs", ICML 2020☆14Updated 5 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- ☆54Updated last year
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 6 years ago
- Toy datasets to evaluate algorithms for domain generalization and invariance learning.☆42Updated 3 years ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆14Updated 6 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆35Updated 5 years ago
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆67Updated 3 years ago
- ☆25Updated 5 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 6 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Updated 6 years ago
- Hybrid Discriminative-Generative Training via Contrastive Learning☆75Updated 2 years ago
- ☆20Updated 5 years ago
- ☆30Updated 4 years ago
- ☆100Updated 3 years ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆41Updated 4 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- ☆55Updated 5 years ago
- ☆37Updated 2 years ago
- Reparameterize your PyTorch modules☆71Updated 4 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- ☆34Updated 4 years ago
- The original code for the paper "How to train your MAML" along with a replication of the original "Model Agnostic Meta Learning" (MAML) p…☆41Updated 4 years ago