bhosmer / mmLinks
☆136Updated 10 months ago
Alternatives and similar repositories for mm
Users that are interested in mm are comparing it to the libraries listed below
Sorting:
- An interactive exploration of Transformer programming.☆269Updated last year
- Automatic gradient descent☆210Updated 2 years ago
- σ-GPT: A New Approach to Autoregressive Models☆67Updated last year
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆291Updated last year
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆76Updated last year
- ☆281Updated last year
- A really tiny autograd engine☆94Updated 3 months ago
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025)☆215Updated 4 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- ☆142Updated 2 weeks ago
- ☆89Updated last year
- ☆261Updated this week
- seqax = sequence modeling + JAX☆167Updated last month
- JAX-Toolbox☆337Updated this week
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- Implementation of Flash Attention in Jax☆217Updated last year
- ☆144Updated 2 years ago
- Solve puzzles. Learn CUDA.☆63Updated last year
- Modular, scalable library to train ML models☆166Updated this week
- Functional local implementations of main model parallelism approaches☆96Updated 2 years ago
- run paligemma in real time☆132Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆192Updated last year
- Resources from the EleutherAI Math Reading Group☆54Updated 6 months ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆123Updated this week
- A puzzle to learn about prompting☆135Updated 2 years ago
- 🧱 Modula software package☆237Updated last month
- ☆30Updated last year
- Neural Networks for JAX☆84Updated 11 months ago
- Graph neural networks in JAX.☆67Updated last year
- Python bindings for ggml☆146Updated last year