bhosmer / mm
☆118Updated 2 months ago
Alternatives and similar repositories for mm:
Users that are interested in mm are comparing it to the libraries listed below
- Automatic gradient descent☆206Updated last year
- ☆201Updated 6 months ago
- An interactive exploration of Transformer programming.☆255Updated last year
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆71Updated 5 months ago
- Implementation of Flash Attention in Jax☆204Updated 10 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆58Updated 5 months ago
- Graph neural networks in JAX.☆67Updated 6 months ago
- Resources from the EleutherAI Math Reading Group☆52Updated last month
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆182Updated 7 months ago
- Solve puzzles. Learn CUDA.☆61Updated last year
- A really tiny autograd engine☆89Updated 9 months ago
- A Jax-based library for designing and training transformer models from scratch.☆280Updated 4 months ago
- ☆143Updated last year
- ☆36Updated last month
- Named Tensors for Legible Deep Learning in JAX☆157Updated last week
- σ-GPT: A New Approach to Autoregressive Models☆61Updated 5 months ago
- A puzzle to learn about prompting☆123Updated last year
- A simple library for scaling up JAX programs☆129Updated 2 months ago
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆102Updated last year
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆19Updated 3 months ago
- Cellular Automata Accelerated in JAX☆77Updated last month
- Run PyTorch in JAX. 🤝☆214Updated last week
- ☆58Updated 2 years ago
- ☆75Updated 6 months ago
- 🧱 Modula software package☆132Updated this week
- Neural Networks for JAX☆83Updated 3 months ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆184Updated 2 years ago
- Multibackend Graph Neural Networks in Keras 3☆25Updated 11 months ago
- JAX implementation of the Llama 2 model☆213Updated 11 months ago
- ☆52Updated 8 months ago