bhosmer / mm
☆130Updated 5 months ago
Alternatives and similar repositories for mm:
Users that are interested in mm are comparing it to the libraries listed below
- A really tiny autograd engine☆91Updated last year
- An interactive exploration of Transformer programming.☆262Updated last year
- A Jax-based library for designing and training small transformers.☆286Updated 7 months ago
- σ-GPT: A New Approach to Autoregressive Models☆62Updated 8 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆60Updated 8 months ago
- Solve puzzles. Learn CUDA.☆63Updated last year
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆73Updated 8 months ago
- Named Tensors for Legible Deep Learning in JAX☆172Updated this week
- ☆215Updated 9 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆390Updated last week
- 🧱 Modula software package☆188Updated 3 weeks ago
- ☆87Updated last year
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆242Updated this week
- Automatic gradient descent☆207Updated last year
- Gpu benchmark☆59Updated 2 months ago
- Graph neural networks in JAX.☆67Updated 10 months ago
- ☆128Updated 2 weeks ago
- ☆60Updated 3 years ago
- JAX implementation of the Llama 2 model☆218Updated last year
- Inference of Mamba models in pure C☆187Updated last year
- Implementation of Flash Attention in Jax☆206Updated last year
- Resources from the EleutherAI Math Reading Group☆53Updated last month
- Experiment of using Tangent to autodiff triton☆78Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆270Updated 10 months ago
- ☆201Updated this week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆38Updated this week
- JAX-Toolbox☆298Updated this week
- ☆77Updated 9 months ago
- ☆99Updated this week
- Puzzles for exploring transformers☆343Updated last year