lindermanlab / elkLinks
Scalable and Stable Parallelization of Nonlinear RNNS
☆17Updated 5 months ago
Alternatives and similar repositories for elk
Users that are interested in elk are comparing it to the libraries listed below
Sorting:
- Implementation of PSGD optimizer in JAX☆33Updated 6 months ago
- ☆51Updated last year
- ☆110Updated last month
- ☆32Updated 9 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆91Updated 4 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆84Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Updated last week
- Maximal Update Parametrization (μP) with Flax & Optax.☆11Updated last year
- Parallelizing non-linear sequential models over the sequence length☆52Updated 3 weeks ago
- ☆53Updated 9 months ago
- ☆31Updated 7 months ago
- The Energy Transformer block, in JAX☆57Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆123Updated 6 months ago
- Pytorch-like dataloaders for JAX.☆90Updated last month
- Comparison between GFlowNets & Maximum Entropy RL☆18Updated last year
- 🧱 Modula software package☆204Updated 3 months ago
- seqax = sequence modeling + JAX☆165Updated last month
- A simple library for scaling up JAX programs☆139Updated 8 months ago
- supporting pytorch FSDP for optimizers☆82Updated 7 months ago
- LoRA for arbitrary JAX models and functions☆140Updated last year
- nanoGPT using Equinox☆13Updated 2 years ago
- Accelerated First Order Parallel Associative Scan☆182Updated 10 months ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆95Updated last month
- A State-Space Model with Rational Transfer Function Representation.☆79Updated last year
- ☆53Updated last year
- ☆164Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆147Updated 2 weeks ago
- Experiments on the impact of depth in transformers and SSMs.☆32Updated 8 months ago
- ☆40Updated last year
- Graph neural networks in JAX.☆67Updated last year