ZQZCalin / trainit
☆11Updated this week
Alternatives and similar repositories for trainit:
Users that are interested in trainit are comparing it to the libraries listed below
- PyTorch linear operators for curvature matrices (Hessian, Fisher/GGN, KFAC, ...)☆36Updated last week
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆21Updated 5 months ago
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆14Updated 5 months ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆173Updated this week
- ☆16Updated last year
- Amortized Probabilistic Conditioning for Optimization, Simulation and Inference (Chang et al., AISTATS 2025)☆15Updated this week
- IVON optimizer for neural networks based on variational learning.☆62Updated 5 months ago
- ☆10Updated 3 years ago
- ☆67Updated 4 months ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 9 months ago
- [TMLR 2022] Curvature access through the generalized Gauss-Newton's low-rank structure: Eigenvalues, eigenvectors, directional derivative…☆17Updated last year
- Minimal but scalable implementation of large language models in JAX☆34Updated 5 months ago
- Turn jitted jax functions back into python source code☆22Updated 4 months ago
- Parameter-Free Optimizers for Pytorch☆123Updated last year
- Riemannian Optimization Using JAX☆48Updated last year
- ☆17Updated 10 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆33Updated last year
- 🧱 Modula software package☆188Updated 3 weeks ago
- Sampling with gradient-based Markov Chain Monte Carlo approaches☆99Updated last year
- Agustinus' very opiniated publication-ready plotting library☆64Updated 2 months ago
- ☆52Updated 6 months ago
- PEPit is a package enabling computer-assisted worst-case analyses of first-order optimization methods.☆86Updated 3 months ago
- ☆17Updated 8 months ago
- Hessian trace estimation using PyTorch and Hutch++☆19Updated 4 years ago
- A general-purpose, deep learning-first library for constrained optimization in PyTorch☆115Updated 3 weeks ago
- Limitations of the Empirical Fisher Approximation☆47Updated last month
- Pytorch code for experiments on Linear Transformers☆20Updated last year
- ☆80Updated 3 years ago
- If it quacks like a tensor...☆58Updated 5 months ago
- ☆102Updated this week