EleutherAI / cookbookLinks
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆824Updated 3 months ago
Alternatives and similar repositories for cookbook
Users that are interested in cookbook are comparing it to the libraries listed below
Sorting:
- Best practices & guides on how to write distributed pytorch training code☆536Updated 3 weeks ago
- What would you do with 1000 H100s...☆1,124Updated last year
- ☆545Updated last year
- Puzzles for exploring transformers☆376Updated 2 years ago
- System 2 Reasoning Link Collection☆855Updated 8 months ago
- ☆525Updated 3 months ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,033Updated 6 months ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,222Updated this week
- ☆457Updated last year
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,892Updated 2 months ago
- Minimalistic large language model 3D-parallelism training☆2,323Updated 2 months ago
- Fast bare-bones BPE for modern tokenizer training☆168Updated 4 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆679Updated last week
- Building blocks for foundation models.☆572Updated last year
- Open-source framework for the research and development of foundation models.☆611Updated this week
- A bibliography and survey of the papers surrounding o1☆1,209Updated last year
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆691Updated this week
- Async RL Training at Scale☆770Updated this week
- Website for hosting the Open Foundation Models Cheat Sheet.☆268Updated 6 months ago
- UNet diffusion model in pure CUDA☆654Updated last year
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆743Updated last week
- ☆791Updated this week
- ☆225Updated 3 weeks ago
- Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆927Updated this week
- A puzzle to learn about prompting☆134Updated 2 years ago
- Recipes to scale inference-time compute of open models☆1,118Updated 5 months ago
- A repository for research on medium sized language models.☆518Updated 5 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆196Updated 5 months ago
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆231Updated 3 months ago