A curated list of awesome resources for Artificial Intelligence Alignment research
☆80Jul 14, 2023Updated 2 years ago
Alternatives and similar repositories for awesome-ai-alignment
Users that are interested in awesome-ai-alignment are comparing it to the libraries listed below
Sorting:
- Machine Learning for Alignment Bootcamp☆82Apr 27, 2022Updated 3 years ago
- An Obsidian starter kit for LessWrong, Effective Altruism, AI Alignment, etc.☆14Nov 12, 2022Updated 3 years ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆16Nov 21, 2025Updated 3 months ago
- Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022☆11Aug 9, 2022Updated 3 years ago
- lanmt ebm☆12Jun 19, 2020Updated 5 years ago
- Awesome Forums☆36Jun 24, 2020Updated 5 years ago
- ☆13Aug 9, 2023Updated 2 years ago
- A list of awesome and useful websites that should be in your bookmarks. 👾🤖☆16Aug 10, 2019Updated 6 years ago
- Awesome Firefox Extensions☆64May 19, 2022Updated 3 years ago
- Boilerplate for creating awesome repositories. Just fork and be awesome!☆15Dec 6, 2015Updated 10 years ago
- ☆14Mar 31, 2024Updated last year
- Website for PauseAI.info☆24Updated this week
- A Collection of Awesome Personal Search Engines and Related Projects☆20Jan 10, 2023Updated 3 years ago
- A curated list of awesome books, films and wherever about time travel.☆13Oct 1, 2019Updated 6 years ago
- Interpretability dashboard for reinforcement learners☆16Jun 4, 2019Updated 6 years ago
- Models for data stocks and training dataset sizes☆18Jul 10, 2024Updated last year
- A curated list of resources, data, tools, scholarship related to Open Access, Data and Open Science☆49Sep 27, 2023Updated 2 years ago
- ☆19Mar 5, 2024Updated 2 years ago
- A curated list of awesome e-commerce resources☆43Dec 7, 2022Updated 3 years ago
- Collection of linux sysadmin/devop interview questions☆18Aug 21, 2015Updated 10 years ago
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- Awesome GPT-4 with Applications. This is a collection of resources related to GPT-4, including news, official documents, demo and applica…☆20Mar 15, 2023Updated 2 years ago
- A community-curated repository of 🔥 learning resources☆95Jan 18, 2018Updated 8 years ago
- ☆23Jan 7, 2025Updated last year
- My personal list of electronics resources for DIY☆63Nov 25, 2023Updated 2 years ago
- An awesome list of awesome open access projects☆24Jan 22, 2019Updated 7 years ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆24Mar 25, 2025Updated 11 months ago
- Awesome Podcasts☆93Apr 7, 2023Updated 2 years ago
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆20Oct 18, 2024Updated last year
- Like all persons of the Library, I have traveled in my youth; I have wandered in search of a book, perhaps the catalogue of catalogues...…☆151Nov 6, 2022Updated 3 years ago
- Official repo for the paper "Make Some Noise: Reliable and Efficient Single-Step Adversarial Training" (https://arxiv.org/abs/2202.01181)☆25Oct 17, 2022Updated 3 years ago
- ☆24Jan 28, 2025Updated last year
- Training (hopefully) safe agents in gridworlds☆25May 12, 2019Updated 6 years ago
- Tools for studying developmental interpretability in neural networks.☆127Dec 30, 2025Updated 2 months ago
- ☆27Oct 6, 2024Updated last year
- Awesome Telegram Groups☆66Nov 19, 2022Updated 3 years ago
- Articles related technical leadership or career growth☆27Apr 11, 2022Updated 3 years ago
- ☆31Nov 7, 2024Updated last year
- Code repo for the model organisms and convergent directions of EM papers.☆53Sep 22, 2025Updated 5 months ago