bhosmer / mmLinks
☆131Updated 8 months ago
Alternatives and similar repositories for mm
Users that are interested in mm are comparing it to the libraries listed below
Sorting:
- An interactive exploration of Transformer programming.☆265Updated last year
- Automatic gradient descent☆208Updated 2 years ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆290Updated 10 months ago
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆424Updated this week
- Solve puzzles. Learn CUDA.☆64Updated last year
- A puzzle to learn about prompting☆131Updated 2 years ago
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆76Updated 11 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated 11 months ago
- σ-GPT: A New Approach to Autoregressive Models☆65Updated 11 months ago
- ☆273Updated last year
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025)☆205Updated 2 months ago
- Alex Krizhevsky's original code from Google Code☆194Updated 9 years ago
- ☆143Updated 2 years ago
- ☆230Updated this week
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆190Updated last year
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- ☆54Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆115Updated 2 weeks ago
- Implementation of Flash Attention in Jax☆213Updated last year
- ☆134Updated 3 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆280Updated this week
- Repository for code used in the xVal paper☆136Updated last year
- JAX-Toolbox☆321Updated this week
- Getting crystal-like representations with harmonic loss☆191Updated 3 months ago
- ☆88Updated last year
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 2 months ago
- Gpu benchmark☆63Updated 5 months ago
- train with kittens!☆61Updated 8 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆34Updated 8 months ago
- 🧱 Modula software package☆204Updated 3 months ago