edouardoyallon / accoLinks
ACCO: An optimization algorithm for sharded distributed LLM training.
☆12Updated 7 months ago
Alternatives and similar repositories for acco
Users that are interested in acco are comparing it to the libraries listed below
Sorting:
- ☆234Updated last year
- Library for reading and processing ML training data.☆647Updated this week
- CLU lets you write beautiful training loops in JAX.☆365Updated this week
- WoodTapper — a Python toolbox for interpretable and explainable tree ensembles.☆22Updated last month
- Implementation of https://srush.github.io/annotated-s4☆510Updated 6 months ago
- Efficient optimizers☆280Updated 3 weeks ago
- For optimization algorithm research and development.☆556Updated 3 weeks ago
- Orbax provides common checkpointing and persistence utilities for JAX users☆474Updated this week
- Universal Notation for Tensor Operations in Python.☆459Updated 9 months ago
- 🧱 Modula software package☆322Updated 4 months ago
- JAX Synergistic Memory Inspector☆183Updated last year
- Run PyTorch in JAX. 🤝☆309Updated 3 months ago
- ☆261Updated 2 weeks ago
- Named tensors with first-class dimensions for PyTorch☆332Updated 2 years ago
- ☆287Updated last year
- Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…☆292Updated this week
- IVON optimizer for neural networks based on variational learning.☆80Updated last year
- ☆233Updated 11 months ago
- Pytorch-like dataloaders for JAX.☆98Updated 3 weeks ago
- Unofficial JAX implementation of the SOAP optimizer (https://arxiv.org/abs/2409.11321)☆22Updated this week
- Implementation of PSGD optimizer in JAX☆35Updated last year
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆125Updated last year
- PyTorch linear operators for curvature matrices (Hessian, Fisher/GGN, KFAC, ...)☆61Updated this week
- jax-triton contains integrations between JAX and OpenAI Triton☆436Updated last month
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆188Updated 2 weeks ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆688Updated 2 weeks ago
- Second Order Optimization and Curvature Estimation with K-FAC in JAX.☆304Updated this week
- Code for our NeurIPS 2022 paper☆371Updated 3 years ago
- Minimal yet performant LLM examples in pure JAX☆226Updated last week
- JMP is a Mixed Precision library for JAX.☆210Updated 11 months ago