A dataset of alignment research and code to reproduce it
☆78Jun 22, 2023Updated 2 years ago
Alternatives and similar repositories for alignment-research-dataset
Users that are interested in alignment-research-dataset are comparing it to the libraries listed below
Sorting:
- gpt completions in vscode☆35Mar 24, 2023Updated 2 years ago
- ☆22Sep 9, 2021Updated 4 years ago
- Experimental LLM interface exploring new ways to use AI to improve human thinking☆19Updated this week
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- Automatically create Anki cards from text using language models☆20Jan 7, 2023Updated 3 years ago
- Multiversal tree writing interface for human-AI collaboration☆1,342Jun 28, 2024Updated last year
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆16Sep 15, 2023Updated 2 years ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- ☆22Jul 18, 2024Updated last year
- Factored Cognition Primer: How to write compositional language model programs☆52Feb 22, 2023Updated 3 years ago
- ☆44Dec 28, 2022Updated 3 years ago
- Read, write and manipulate code which reads, writes and manipulates code.☆10Mar 15, 2020Updated 5 years ago
- A distributed network based on hash codes and lattices.☆14Aug 16, 2016Updated 9 years ago
- The AI that helps you achieve your goals☆11Feb 4, 2024Updated 2 years ago
- ✒️ A gallery of experiments with Scalable Vector Graphics (SVG) and interactive visualizations.☆13Jan 6, 2023Updated 3 years ago
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago
- An analog touch screen joystick that pretends to be a bevy gamepad☆13Jul 13, 2024Updated last year
- Customizable charts made with TikZ and LaTeX3☆14Feb 11, 2023Updated 3 years ago
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- Command-line recursive question-answering with immutable contexts and explicit data store☆26Sep 21, 2018Updated 7 years ago
- Stampy's copy of Alignment Research Dataset scraper☆13Dec 26, 2025Updated 2 months ago
- Source code to "SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks"☆10Dec 17, 2023Updated 2 years ago
- tuimorphic choose-your-own-adventure story game☆16Jan 19, 2026Updated last month
- Automated terminal emulator benchmarks☆22Updated this week
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago
- ☆12Oct 23, 2022Updated 3 years ago
- Sparse Autoencoder Training Library☆55May 1, 2025Updated 10 months ago
- AI Safety Q&A web frontend☆41Updated this week
- General-Sum variant of the game Diplomacy for evaluating AIs.☆34Apr 2, 2024Updated last year
- Machine Learning for Alignment Bootcamp (MLAB).☆31Jan 24, 2022Updated 4 years ago
- Tools for understanding how transformer predictions are built layer-by-layer☆570Aug 7, 2025Updated 6 months ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆17Jun 29, 2025Updated 8 months ago
- A script for immunizing a google account for the effects of 13 September which will break some Google Drive Links☆14Sep 13, 2021Updated 4 years ago
- ☆21Feb 20, 2026Updated last week
- Measuring the situational awareness of language models☆40Feb 12, 2024Updated 2 years ago
- A Loom implementation in Obsidian☆326Mar 20, 2025Updated 11 months ago
- Interactive Composition Explorer: a debugger for compositional language model programs☆567Jan 5, 2026Updated 2 months ago
- ☆32May 23, 2023Updated 2 years ago
- A small game demonstrating a grid distortion effect☆15Oct 5, 2021Updated 4 years ago