A dataset of alignment research and code to reproduce it
☆78Jun 22, 2023Updated 2 years ago
Alternatives and similar repositories for alignment-research-dataset
Users that are interested in alignment-research-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- gpt completions in vscode☆35Mar 24, 2023Updated 3 years ago
- Conversational chatbot to answer questions about AI Safety & Alignment based on information retrieved from the Alignment Research Dataset☆15Apr 4, 2026Updated last week
- Experimental LLM interface exploring new ways to use AI to improve human thinking☆19Updated this week
- ☆22Jul 18, 2024Updated last year
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Multiversal tree writing interface for human-AI collaboration☆1,348Jun 28, 2024Updated last year
- Automatically create Anki cards from text using language models☆20Jan 7, 2023Updated 3 years ago
- Project exploring 3D volumetric rendering of NEXRAD radar data.☆12Oct 23, 2023Updated 2 years ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆217Apr 6, 2026Updated last week
- Indranet Explorer, a simulated browser☆16Nov 12, 2024Updated last year
- Measuring the situational awareness of language models☆40Feb 12, 2024Updated 2 years ago
- botttom-up vr redux☆25Jul 30, 2021Updated 4 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Nov 29, 2023Updated 2 years ago
- Read, write and manipulate code which reads, writes and manipulates code.☆10Mar 15, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Hypercorn is an ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.☆15Jan 12, 2026Updated 3 months ago
- The AI that helps you achieve your goals☆11Feb 4, 2024Updated 2 years ago
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆15Jun 3, 2023Updated 2 years ago
- ☆21Aug 18, 2022Updated 3 years ago
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- graphpatch is a library for activation patching on PyTorch neural network models.☆21Feb 11, 2025Updated last year
- An analog touch screen joystick that pretends to be a bevy gamepad☆13Jul 13, 2024Updated last year
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆16Sep 15, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Flight Recorder allows to record client program execution and examine it later☆11Sep 18, 2020Updated 5 years ago
- ☆12Oct 23, 2022Updated 3 years ago
- A small game demonstrating a grid distortion effect☆15Oct 5, 2021Updated 4 years ago
- ☆21Updated this week
- A library for training crosscoders☆16May 28, 2025Updated 10 months ago
- ☆22May 3, 2022Updated 3 years ago
- Machine Learning for Alignment Bootcamp (MLAB).☆33Jan 24, 2022Updated 4 years ago
- Tools for understanding how transformer predictions are built layer-by-layer☆584Aug 7, 2025Updated 8 months ago
- A command line utility for doing polarization simulations☆17Aug 21, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Jun 8, 2023Updated 2 years ago
- see github.com/understanding-search/maze-transformer☆10Dec 8, 2023Updated 2 years ago
- tuimorphic choose-your-own-adventure story game☆18Mar 3, 2026Updated last month
- AI Safety Q&A web frontend☆41Apr 4, 2026Updated last week
- Benchmarking LLM Inference Speeds☆13Apr 7, 2026Updated last week
- ☆65Nov 4, 2021Updated 4 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago