lindermanlab / elkLinks
Scalable and Stable Parallelization of Nonlinear RNNS
☆19Updated 7 months ago
Alternatives and similar repositories for elk
Users that are interested in elk are comparing it to the libraries listed below
Sorting:
- ☆51Updated last year
- ☆56Updated 10 months ago
- ☆32Updated 10 months ago
- ☆115Updated 2 months ago
- Implementation of PSGD optimizer in JAX☆34Updated 7 months ago
- 📄Small Batch Size Training for Language Models☆43Updated this week
- The Energy Transformer block, in JAX☆59Updated last year
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆42Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆87Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆93Updated 5 months ago
- Pytorch-like dataloaders for JAX.☆94Updated 3 months ago
- ☆31Updated 9 months ago
- ☆29Updated 2 weeks ago
- A simple library for scaling up JAX programs☆143Updated 9 months ago
- LoRA for arbitrary JAX models and functions☆142Updated last year
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- Flow-matching algorithms in JAX☆101Updated last year
- supporting pytorch FSDP for optimizers☆84Updated 8 months ago
- ☆34Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆153Updated 2 months ago
- 🧱 Modula software package☆225Updated last week
- ☆35Updated 4 months ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆97Updated 2 weeks ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated 3 months ago
- ☆207Updated 8 months ago
- Experiments on the impact of depth in transformers and SSMs.☆33Updated 9 months ago
- Running Jax in PyTorch Lightning☆111Updated 8 months ago
- Official Jax Implementation of MD4 Masked Diffusion Models☆123Updated 6 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆125Updated 8 months ago