google / qwixLinks
a Jax quantization library
☆46Updated this week
Alternatives and similar repositories for qwix
Users that are interested in qwix are comparing it to the libraries listed below
Sorting:
- Minimal yet performant LLM examples in pure JAX☆181Updated 2 weeks ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆38Updated last month
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆123Updated 3 weeks ago
- A simple library for scaling up JAX programs☆143Updated 11 months ago
- ☆264Updated this week
- ☆120Updated 3 months ago
- ☆67Updated 10 months ago
- LoRA for arbitrary JAX models and functions☆142Updated last year
- Modular, scalable library to train ML models☆165Updated this week
- JAX-Toolbox☆343Updated this week
- Tokamax: A GPU and TPU kernel library.☆87Updated this week
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆89Updated last year
- ☆115Updated last month
- ☆189Updated last week
- supporting pytorch FSDP for optimizers☆84Updated 10 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Updated 2 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Machine Learning eXperiment Utilities☆47Updated 2 months ago
- 📄Small Batch Size Training for Language Models☆62Updated last week
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆144Updated 5 months ago
- ☆91Updated last year
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year
- ☆146Updated 2 weeks ago
- ☆33Updated 10 months ago
- Experimenting with how best to do multi-host dataloading☆10Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆35Updated last month
- ☆46Updated this week
- seqax = sequence modeling + JAX☆167Updated 2 months ago
- A simple, performant and scalable JAX-based world modeling codebase☆76Updated this week
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.☆70Updated last month