google / qwixLinks
a Jax quantization library
☆35Updated last week
Alternatives and similar repositories for qwix
Users that are interested in qwix are comparing it to the libraries listed below
Sorting:
- Minimal yet performant LLM examples in pure JAX☆158Updated this week
- A JAX-native LLM Post-Training Library☆143Updated this week
- ☆65Updated 10 months ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆123Updated last month
- A simple library for scaling up JAX programs☆143Updated 10 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆36Updated last week
- ☆188Updated 2 weeks ago
- LoRA for arbitrary JAX models and functions☆142Updated last year
- ☆34Updated 9 months ago
- ☆261Updated this week
- seqax = sequence modeling + JAX☆167Updated last month
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Two implementations of ZeRO-1 optimizer sharding in JAX☆14Updated 2 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Updated last month
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 3 months ago
- ☆34Updated last year
- A simple, performant and scalable JAX-based world modeling codebase☆73Updated this week
- Focused on fast experimentation and simplicity☆75Updated 8 months ago
- JAX implementation of the Llama 2 model☆219Updated last year
- ☆118Updated 3 months ago
- 📄Small Batch Size Training for Language Models☆60Updated 3 weeks ago
- Implementation of Flash Attention in Jax☆216Updated last year
- FlashRNN - Fast RNN Kernels with I/O Awareness☆97Updated 3 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year
- JAX bindings for Flash Attention v2☆91Updated last week
- Machine Learning eXperiment Utilities☆47Updated last month
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆71Updated 5 months ago
- A set of Python scripts that makes your experience on TPU better☆54Updated last year
- JAX-Toolbox☆335Updated this week
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆88Updated last year