Language model alignment-focused deep learning curriculum
☆1,537Aug 19, 2024Updated last year
Alternatives and similar repositories for deep_learning_curriculum
Users that are interested in deep_learning_curriculum are comparing it to the libraries listed below
Sorting:
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆240Aug 11, 2025Updated 6 months ago
- A library for mechanistic interpretability of GPT-style language models☆3,133Updated this week
- Machine Learning for Alignment Bootcamp☆82Apr 27, 2022Updated 3 years ago
- ☆960Updated this week
- Mechanistic Interpretability Visualizations using React☆328Dec 18, 2024Updated last year
- Solve puzzles. Improve your pytorch.☆3,966Jul 15, 2024Updated last year
- What would you do with 1000 H100s...☆1,155Jan 10, 2024Updated 2 years ago
- The nnsight package enables interpreting and manipulating the internals of deep learned models.☆836Updated this week
- Decoder only transformer, built from scratch with PyTorch☆33Oct 22, 2023Updated 2 years ago
- Machine Learning for Alignment Bootcamp☆27Mar 7, 2024Updated last year
- Training Sparse Autoencoders on Language Models☆1,233Feb 27, 2026Updated last week
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,738Jan 8, 2024Updated 2 years ago
- A dataset of alignment research and code to reproduce it☆78Jun 22, 2023Updated 2 years ago
- Pen and paper exercises in machine learning☆2,621May 21, 2024Updated last year
- ☆399Aug 21, 2025Updated 6 months ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆217Updated this week
- ☆273Oct 1, 2024Updated last year
- Machine Learning for Alignment Bootcamp (MLAB).☆31Jan 24, 2022Updated 4 years ago
- A text-based game where language models learn to lie and to detect lies.☆12Oct 4, 2023Updated 2 years ago
- ☆284Mar 2, 2024Updated 2 years ago
- A playbook for systematically maximizing the performance of deep learning models.☆29,879Jun 18, 2024Updated last year
- LLM101n: Let's build a Storyteller☆36,390Aug 1, 2024Updated last year
- ☆566Jul 11, 2024Updated last year
- Measuring the situational awareness of language models☆40Feb 12, 2024Updated 2 years ago
- Tools for studying developmental interpretability in neural networks.☆127Dec 30, 2025Updated 2 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆247Feb 27, 2026Updated last week
- ☆20Nov 15, 2024Updated last year
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆832Feb 26, 2026Updated last week
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,415Feb 20, 2026Updated 2 weeks ago
- List of AI Residency Programs☆3,273Apr 4, 2025Updated 11 months ago
- 200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.☆2,332Dec 9, 2025Updated 2 months ago
- Tools for understanding how transformer predictions are built layer-by-layer☆570Aug 7, 2025Updated 6 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆134Feb 15, 2026Updated 2 weeks ago
- Sparsify transformers with SAEs and transcoders☆699Updated this week
- Machine Learning Engineering Open Book☆17,286Feb 21, 2026Updated 2 weeks ago
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆23,746Aug 15, 2024Updated last year
- Solve puzzles. Learn CUDA.☆11,970Sep 1, 2024Updated last year
- A puzzle to learn about prompting☆135May 12, 2023Updated 2 years ago
- Train transformer language models with reinforcement learning.☆17,523Updated this week