EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆662Updated last month
Related projects: ⓘ
- Puzzles for learning Triton☆966Updated this week
- What would you do with 1000 H100s...☆816Updated 8 months ago
- System 2 Reasoning Link Collection☆597Updated this week
- Best practices for distilling large language models.☆370Updated 7 months ago
- Minimalistic large language model 3D-parallelism training☆1,111Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆774Updated 3 weeks ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆871Updated this week
- UNet diffusion model in pure CUDA☆562Updated 2 months ago
- LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processin…☆659Updated this week
- Puzzles for exploring transformers☆293Updated last year
- GPT-2 (124M) quality in 5B tokens☆227Updated last week
- An ML Systems Onboarding list☆491Updated last month
- The Tensor (or Array)☆388Updated last month
- The Multilayer Perceptron Language Model☆503Updated last month
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆452Updated last week
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,131Updated this week
- Fast bare-bones BPE for modern tokenizer training☆138Updated 3 weeks ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆1,935Updated last week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆616Updated last week
- Building blocks for foundation models.☆345Updated 8 months ago
- ☆274Updated this week
- Annotated version of the Mamba paper☆445Updated 6 months ago
- The Autograd Engine☆482Updated last week
- ReFT: Representation Finetuning for Language Models☆1,076Updated 2 weeks ago
- nanoGPT style version of Llama 3.1☆1,162Updated last month
- Automatically evaluate your LLMs in Google Colab☆511Updated 4 months ago
- A comprehensive deep dive into the world of tokens☆212Updated 2 months ago
- ☆856Updated 9 months ago
- A repository for research on medium sized language models.☆469Updated 3 weeks ago