kvfrans / splusLinks
β115Updated 2 months ago
Alternatives and similar repositories for splus
Users that are interested in splus are comparing it to the libraries listed below
Sorting:
- πSmall Batch Size Training for Language Modelsβ42Updated 2 weeks ago
- A simple library for scaling up JAX programsβ143Updated 9 months ago
- Implementation of PSGD optimizer in JAXβ34Updated 7 months ago
- Flow-matching algorithms in JAXβ101Updated last year
- β31Updated 9 months ago
- A simple, performant and scalable JAX-based world modeling codebaseβ70Updated this week
- Maximal Update Parametrization (ΞΌP) with Flax & Optax.β16Updated last year
- LoRA for arbitrary JAX models and functionsβ141Updated last year
- WIPβ94Updated last year
- π§± Modula software packageβ222Updated 3 weeks ago
- supporting pytorch FSDP for optimizersβ84Updated 8 months ago
- Minimal but scalable implementation of large language models in JAXβ35Updated last month
- If it quacks like a tensor...β58Updated 9 months ago
- [ICLR'25] Artificial Kuramoto Oscillatory Neuronsβ96Updated last week
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAXβ86Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adamβ85Updated last year
- Minimal yet performant LLM examples in pure JAXβ148Updated this week
- β65Updated 9 months ago
- Pytorch-like dataloaders for JAX.β94Updated 2 months ago
- Accelerated First Order Parallel Associative Scanβ187Updated last year
- β208Updated 8 months ago
- Scalable and Stable Parallelization of Nonlinear RNNSβ19Updated 6 months ago
- β51Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorchβ96Updated 3 weeks ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT trainingβ130Updated last year
- Run PyTorch in JAX. π€β277Updated this week
- Jax/Flax rewrite of Karpathy's nanoGPTβ59Updated 2 years ago
- β56Updated 10 months ago
- β69Updated last year
- Efficient optimizersβ254Updated 3 weeks ago