bhosmer / mmLinks

☆138

Alternatives and similar repositories for mm

Users that are interested in mm are comparing it to the libraries listed below

Sorting:

srush / raspy
An interactive exploration of Transformer programming.
☆271Updated 2 years ago
jxbz / agd
Automatic gradient descent
☆217Updated 2 years ago
HenryNdubuaku / nanodl
A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.
☆300Updated last year
RobertRiachi / nanoPALM
☆144Updated 2 years ago
joey00072 / Tinytorch
A really tiny autograd engine
☆99Updated 8 months ago
idiap / sigma-gpt
σ-GPT: A New Approach to Autoregressive Models
☆70Updated last year
vdesai2014 / inference-optimization-blog-post
☆91Updated last year
lucidrains / flash-attention-jax
Implementation of Flash Attention in Jax
☆225Updated last year
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆198Updated last year
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆82Updated 2 years ago
google / jaxonnxruntime
A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.
☆131Updated last week
maxencefaldor / cax
Cellular Automata Accelerated in JAX (Oral at ICLR 2025)
☆243Updated 2 months ago
vtabbott / Algebraic-NCD
A package for defining deep learning models using categorical algebraic expressions.
☆61Updated last year
AI-Hypercomputer / maxdiffusion
☆307Updated this week
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆216Updated 2 years ago
sumo43 / loopvlm
run paligemma in real time
☆133Updated last year
google-deepmind / tracr
☆551Updated 2 years ago
sangmichaelxie / cs324_p2
Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)
☆105Updated 2 years ago
rasbt / pytorch-memory-optim
This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…
☆92Updated 2 years ago
dshah3 / GPU-Puzzles
Solve puzzles. Learn CUDA.
☆63Updated 2 years ago
NVIDIA / JAX-Toolbox
JAX-Toolbox
☆382Updated this week
allenai / fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
☆269Updated 9 months ago
irregular-rhomboid / EAI-Math-Reading-Group
Resources from the EleutherAI Math Reading Group
☆54Updated 11 months ago
srush / GPTWorld
A puzzle to learn about prompting
☆135Updated 2 years ago
NousResearch / StripedHyenaTrainer
☆62Updated 2 years ago
itsdaniele / jeometric
Graph neural networks in JAX.
☆68Updated last year
google-deepmind / nanodo
☆291Updated last year
divyamakkar0 / JAXformer
A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.
☆115Updated last month
mxbi / arckit
Tools for working with the Abstraction & Reasoning Corpus
☆215Updated 5 months ago
jax-ml / jax-triton
jax-triton contains integrations between JAX and OpenAI Triton
☆439Updated this week