edouardoyallon / accoLinks
ACCO: An optimization algorithm for sharded distributed LLM training.
☆11Updated last month
Alternatives and similar repositories for acco
Users that are interested in acco are comparing it to the libraries listed below
Sorting:
- ☆190Updated 6 months ago
- Modern Fixed Point Systems using Pytorch☆94Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆177Updated 2 weeks ago
- PyTorch linear operators for curvature matrices (Hessian, Fisher/GGN, KFAC, ...)☆40Updated 2 months ago
- ☆270Updated 11 months ago
- IVON optimizer for neural networks based on variational learning.☆68Updated 7 months ago
- 🧱 Modula software package☆200Updated 3 months ago
- Efficient optimizers☆220Updated last week
- A library for unit scaling in PyTorch☆125Updated 6 months ago
- Named tensors with first-class dimensions for PyTorch☆332Updated 2 years ago
- CLU lets you write beautiful training loops in JAX.☆346Updated this week
- JAX Arrays for human consumption☆93Updated last week
- LoRA for arbitrary JAX models and functions☆139Updated last year
- JAX Synergistic Memory Inspector☆174Updated 11 months ago
- ☆11Updated 11 months ago
- Unofficial JAX implementations of deep learning research papers☆156Updated 3 years ago
- Implementation of PSGD optimizer in JAX☆33Updated 5 months ago
- Flow-matching algorithms in JAX☆97Updated 10 months ago
- Run PyTorch in JAX. 🤝☆253Updated 4 months ago
- A simple library for scaling up JAX programs☆139Updated 7 months ago
- JMP is a Mixed Precision library for JAX.☆203Updated 4 months ago
- Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…☆246Updated this week
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆188Updated 6 months ago
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆381Updated 2 months ago
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆109Updated 7 months ago
- ☆157Updated 10 months ago
- Collaborative documentation for and from Jean Zay users. Official Jean Zay documentation: http://www.idris.fr/eng/jean-zay/☆123Updated 11 months ago
- Agustinus' very opiniated publication-ready plotting library☆66Updated last month
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆110Updated 2 months ago
- WIP☆93Updated 10 months ago