srush / LLM-Training-PuzzlesLinks

What would you do with 1000 H100s...

☆1,132

Alternatives and similar repositories for LLM-Training-Puzzles

Users that are interested in LLM-Training-Puzzles are comparing it to the libraries listed below

Sorting:

srush / Transformer-Puzzles
Puzzles for exploring transformers
☆380Updated 2 years ago
srush / Triton-Puzzles
Puzzles for learning Triton
☆2,153Updated last year
HazyResearch / aisys-building-blocks
Building blocks for foundation models.
☆581Updated last year
srush / Autodiff-Puzzles
☆460Updated last year
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆829Updated 4 months ago
rwitten / HighPerfLLMs2024
☆545Updated last year
LambdaLabsML / distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
☆546Updated last month
huggingface / picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
☆1,911Updated 3 months ago
gpu-mode / resource-stream
GPU programming related news and material links
☆1,825Updated 2 months ago
BobMcDear / attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
☆584Updated 3 months ago
srush / annotated-mamba
Annotated version of the Mamba paper
☆491Updated last year
srush / awesome-o1
A bibliography and survey of the papers surrounding o1
☆1,214Updated last year
jax-ml / scaling-book
Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
☆710Updated last week
huggingface / nanotron
Minimalistic large language model 3D-parallelism training
☆2,351Updated 2 weeks ago
marin-community / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆685Updated last week
marin-community / marin
Open-source framework for the research and development of foundation models.
☆648Updated this week
meta-pytorch / attention-gym
Helpful tools and examples for working with flex-attention
☆1,072Updated this week
lucidrains / ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
☆548Updated 6 months ago
ayaka14732 / tpu-starter
Everything you want to know about Google Cloud TPU
☆552Updated last year
evanmiller / LLM-Reading-List
LLM papers I'm reading, mostly on inference and model compression
☆747Updated last year
PiotrNawrot / nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆1,015Updated last year
haoliuhl / ringattention
Large Context Attention
☆753Updated last month
stanford-cs336 / spring2024-lectures
☆403Updated 11 months ago
gpu-mode / awesomeMLSys
An ML Systems Onboarding list
☆945Updated 10 months ago
clu0 / unet.cu
UNet diffusion model in pure CUDA
☆654Updated last year
JonasGeiping / cramming
Cramming the training of a (BERT-type) language model into limited compute.
☆1,353Updated last year
Quentin-Anthony / torch-profiling-tutorial
☆532Updated 4 months ago
mlfoundations / open_lm
A repository for research on medium sized language models.
☆521Updated 6 months ago
huggingface / picotron_tutorial
☆224Updated last week
bkitano / llama-from-scratch
Llama from scratch, or How to implement a paper without crying
☆581Updated last year