bhosmer / mmLinks
☆136Updated 9 months ago
Alternatives and similar repositories for mm
Users that are interested in mm are comparing it to the libraries listed below
Sorting:
- An interactive exploration of Transformer programming.☆267Updated last year
- ☆275Updated last year
- Solve puzzles. Learn CUDA.☆64Updated last year
- Automatic gradient descent☆208Updated 2 years ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆290Updated 11 months ago
- ☆243Updated this week
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆76Updated last year
- JAX-Toolbox☆329Updated this week
- ☆143Updated 2 years ago
- A really tiny autograd engine☆95Updated 2 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆190Updated last year
- A puzzle to learn about prompting☆132Updated 2 years ago
- σ-GPT: A New Approach to Autoregressive Models☆67Updated 11 months ago
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆466Updated this week
- Graph neural networks in JAX.☆67Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆119Updated last week
- seqax = sequence modeling + JAX☆165Updated 2 weeks ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 3 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆35Updated 9 months ago
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆135Updated 4 months ago
- JAX implementation of the Llama 2 model☆219Updated last year
- Modular, scalable library to train ML models☆143Updated this week
- ☆136Updated 4 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆286Updated last week
- Named Tensors for Legible Deep Learning in JAX☆199Updated last week
- run paligemma in real time☆131Updated last year
- Resources from the EleutherAI Math Reading Group☆53Updated 5 months ago
- Functional local implementations of main model parallelism approaches☆96Updated 2 years ago
- JAX implementation of the Mistral 7b v0.2 model☆35Updated last year