for-ai / AI-Alignment-Cohort
☆9Updated last week
Related projects: ⓘ
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆115Updated last year
- Best practices for distilling large language models.☆371Updated 7 months ago
- This repository collects all relevant resources about interpretability in LLMs☆230Updated last week
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆248Updated 10 months ago
- System 2 Reasoning Link Collection☆605Updated this week
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆185Updated 7 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆665Updated last month
- Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions☆601Updated last week
- ☆276Updated 3 weeks ago
- Slides, notes, and materials for the workshop☆297Updated 3 months ago
- ☆274Updated this week
- Building blocks for foundation models.☆347Updated 8 months ago
- Fast bare-bones BPE for modern tokenizer training☆138Updated 3 weeks ago
- Puzzles for exploring transformers☆293Updated last year
- GPT-2 (124M) quality in 5B tokens☆227Updated last week
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆211Updated 5 months ago
- LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processin…☆667Updated this week
- Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024☆227Updated 4 months ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆257Updated 2 months ago
- Annotated version of the Mamba paper☆445Updated 6 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆89Updated last week
- Code examples and jupyter notebooks for the Cohere Platform☆461Updated this week
- ☆256Updated this week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆493Updated this week
- LLM Workshop by Sourab Mangrulkar☆322Updated 3 months ago
- Tools for understanding how transformer predictions are built layer-by-layer☆408Updated 3 months ago
- Graph Machine Learning course, Xavier Bresson, 2023☆570Updated 3 weeks ago
- Sparse autoencoders☆297Updated last week
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelf☆195Updated 5 months ago