The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size
☆19May 19, 2019Updated 6 years ago
Alternatives and similar repositories for DeepnetHessian
Users that are interested in DeepnetHessian are comparing it to the libraries listed below
Sorting:
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆16Apr 27, 2019Updated 6 years ago
- Hessian spectral density estimation in TF and Jax☆125Sep 6, 2020Updated 5 years ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆21Feb 19, 2026Updated 2 weeks ago
- ☆29Nov 30, 2025Updated 3 months ago
- ☆30Feb 11, 2021Updated 5 years ago
- ☆15Apr 19, 2020Updated 5 years ago
- Regularization, Neural Network Training Dynamics☆14Jan 13, 2020Updated 6 years ago
- ☆16Jun 3, 2025Updated 9 months ago
- Wrap around any model to output differentially private prediction sets with finite sample validity on any dataset.☆18Mar 3, 2024Updated 2 years ago
- Efficient empirical NTKs in PyTorch☆22Jun 13, 2022Updated 3 years ago
- ICML 2020, Estimating Generalization under Distribution Shifts via Domain-Invariant Representations☆23Jun 30, 2020Updated 5 years ago
- ☆25Feb 20, 2026Updated 2 weeks ago
- ☆46Jul 21, 2025Updated 7 months ago
- Official PyTorch code release for Implicit Gradient Transport, NeurIPS'19☆21Jun 11, 2019Updated 6 years ago
- ☆83Jan 15, 2020Updated 6 years ago
- Benchmarking Optimizers for LLM Pretraining☆54Dec 30, 2025Updated 2 months ago
- A curated list of resources to help with computational research.☆20Jun 11, 2022Updated 3 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Jan 12, 2019Updated 7 years ago
- ☆11Jun 3, 2025Updated 9 months ago
- Code to reproduce experiments in "Antipodes of Label Differential Privacy PATE and ALIBI"☆32Apr 25, 2022Updated 3 years ago
- ☆28Oct 21, 2022Updated 3 years ago
- Learning protein structure with a differentiable simulator☆27Jul 8, 2019Updated 6 years ago
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆63Feb 26, 2026Updated last week
- Addressing the problem of predicting crime occurrence based on historic records☆11Nov 27, 2019Updated 6 years ago
- ☆56Sep 17, 2025Updated 5 months ago
- ☆84Aug 31, 2023Updated 2 years ago
- Efficient PyTorch Hessian eigendecomposition tools!☆386Feb 29, 2024Updated 2 years ago
- ☆13Jun 18, 2025Updated 8 months ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Hybrid Discriminative-Generative Training via Contrastive Learning☆75May 1, 2023Updated 2 years ago
- Public code for a paper "Lipschitz-Margin Training: Scalable Certification of Perturbation Invariance for Deep Neural Networks."☆35Dec 18, 2018Updated 7 years ago
- ☆37Mar 16, 2022Updated 3 years ago
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆48Apr 9, 2021Updated 4 years ago
- Computing various measures and generalization bounds on convolutional and fully connected networks☆35Dec 13, 2018Updated 7 years ago
- "SCONE: A Novel Stochastic Sampling to Generate Contrastive Views and Hard Negative Samples for Recommendation", WSDM 2025☆15Nov 25, 2025Updated 3 months ago
- ☆11Jul 20, 2021Updated 4 years ago
- Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"☆12Feb 8, 2023Updated 3 years ago
- JMLR Cover Letter Template☆10Dec 15, 2021Updated 4 years ago
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago