packquickly / schedule_free_optxLinks
Schedule free optimiser implemented in JAX using Optimistix
☆14Updated last year
Alternatives and similar repositories for schedule_free_optx
Users that are interested in schedule_free_optx are comparing it to the libraries listed below
Sorting:
- nanoGPT using Equinox☆13Updated 2 years ago
- Einsum-like high-level array sharding API for JAX☆35Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 3 months ago
- LoRA for arbitrary JAX models and functions☆142Updated last year
- If it quacks like a tensor...☆59Updated 10 months ago
- ☆119Updated 3 months ago
- A simple library for scaling up JAX programs☆143Updated 10 months ago
- Code for the paper "Function-Space Learning Rates"☆23Updated 3 months ago
- ☆19Updated 4 months ago
- 🧱 Modula software package☆237Updated last month
- ☆40Updated last year
- ☆17Updated last year
- Pytorch-like dataloaders for JAX.☆94Updated 3 months ago
- Minimal yet performant LLM examples in pure JAX☆158Updated last week
- ☆34Updated last year
- ☆28Updated last year
- ☆67Updated 10 months ago
- Minimal, lightweight JAX implementations of popular models.☆108Updated this week
- ☆34Updated 9 months ago
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 weeks ago
- supporting pytorch FSDP for optimizers☆84Updated 9 months ago
- ☆57Updated 11 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆88Updated last year
- H-Net Dynamic Hierarchical Architecture☆79Updated last week
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- Run PyTorch in JAX. 🤝☆290Updated 2 weeks ago
- JAX implementation of the Mistral 7b v0.2 model☆36Updated last year
- Turn jitted jax functions back into python source code☆22Updated 9 months ago
- ☆22Updated 10 months ago
- Neural Networks for JAX☆84Updated 11 months ago