moirage/alignment-research-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/moirage/alignment-research-dataset)

moirage / alignment-research-dataset

A dataset of alignment research and code to reproduce it

☆80

Alternatives and similar repositories for alignment-research-dataset

Users that are interested in alignment-research-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

quantified-uncertainty / ai-safety-papers
View on GitHub
☆22Sep 9, 2021Updated 4 years ago
socketteer / worldspider
View on GitHub
gpt completions in vscode
☆35Mar 24, 2023Updated 3 years ago
StampyAI / stampy-chat
View on GitHub
Conversational chatbot to answer questions about AI Safety & Alignment based on information retrieved from the Alignment Research Dataset
☆16Jul 2, 2026Updated 3 weeks ago
acsresearch / interlab
View on GitHub
☆22Jul 18, 2024Updated 2 years ago
Chillee / lit-llama
View on GitHub
Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code
☆10Aug 29, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
socketteer / loom
View on GitHub
Multiversal tree writing interface for human-AI collaboration
☆1,367Jun 28, 2024Updated 2 years ago
nickkeesG / Pantheon
View on GitHub
Experimental LLM interface exploring new ways to use AI to improve human thinking
☆21Apr 13, 2026Updated 3 months ago
oughtinc / primer
View on GitHub
Factored Cognition Primer: How to write compositional language model programs
☆52Feb 22, 2023Updated 3 years ago
socketteer / hallucinator
View on GitHub
botttom-up vr redux
☆25Jul 30, 2021Updated 4 years ago
danielway / nexrad-volumetric-renderer
View on GitHub
Project exploring 3D volumetric rendering of NEXRAD radar data.
☆13Oct 23, 2023Updated 2 years ago
oughtinc / patchwork
View on GitHub
Command-line recursive question-answering with immutable contexts and explicit data store
☆26Sep 21, 2018Updated 7 years ago
cosmicoptima / indranet-explorer
View on GitHub
Indranet Explorer, a simulated browser
☆16Nov 12, 2024Updated last year
TomFrederik / unseal
View on GitHub
Mechanistic Interpretability for Transformer Models
☆53Jun 1, 2022Updated 4 years ago
AsaCooperStickland / situational-awareness-evals
View on GitHub
Measuring the situational awareness of language models
☆41Feb 12, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
EleutherAI / elk
View on GitHub
Keeping language models honest by directly eliciting knowledge encoded in their activations.
☆221Updated this week
EleutherAI / best-download
View on GitHub
URL downloader supporting checkpointing and continuous checksumming.
☆19Nov 29, 2023Updated 2 years ago
johnswentworth / tracelang
View on GitHub
Read, write and manipulate code which reads, writes and manipulates code.
☆11Mar 15, 2020Updated 6 years ago
neelnanda-io / Neuroscope
View on GitHub
Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons
☆14Feb 13, 2023Updated 3 years ago
kronusaturn / lw2-viewer
View on GitHub
An alternative frontend for LessWrong 2.0
☆79Jul 16, 2026Updated last week
StampyAI / stampy
View on GitHub
A Discord bot for the Robert Miles AI server
☆41Jan 27, 2026Updated 5 months ago
evan-lloyd / graphpatch
View on GitHub
graphpatch is a library for activation patching on PyTorch neural network models.
☆21Feb 11, 2025Updated last year
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago
ArthurConmy / MishformerLens
View on GitHub
MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…
☆10Oct 7, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
johanhelsing / bevy_touch_stick
View on GitHub
An analog touch screen joystick that pretends to be a bevy gamepad
☆13Jul 13, 2024Updated 2 years ago
eburghar / l3charts
View on GitHub
Customizable charts made with TikZ and LaTeX3
☆14Feb 11, 2023Updated 3 years ago
understanding-search / structured-representations-maze-transformers
View on GitHub
see github.com/understanding-search/maze-transformer
☆10Dec 8, 2023Updated 2 years ago
qrdl / flightrec
View on GitHub
Flight Recorder allows to record client program execution and examine it later
☆11Sep 18, 2020Updated 5 years ago
noanabeshima / github-downloader
View on GitHub
Script for downloading GitHub.
☆13Sep 24, 2020Updated 5 years ago
varkor / DISTORT
View on GitHub
A small game demonstrating a grid distortion effect
☆15Oct 5, 2021Updated 4 years ago
kxcloud / gradient-routing
View on GitHub
☆11Dec 4, 2024Updated last year
halcy / tpuddim
View on GitHub
☆22May 3, 2022Updated 4 years ago
danielmamay / mlab
View on GitHub
Machine Learning for Alignment Bootcamp (MLAB).
☆34Jan 24, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AlignmentResearch / tuned-lens
View on GitHub
Tools for understanding how transformer predictions are built layer-by-layer
☆605Aug 7, 2025Updated 11 months ago
TBosak / ability
View on GitHub
Ability is a browser extension that helps people with varying degrees of ability have more control over their browsing experience.
☆22Sep 24, 2025Updated 10 months ago
AranKomat / Diff-DALLE
View on GitHub
☆65Nov 4, 2021Updated 4 years ago
zmitchell / polsim
View on GitHub
A command line utility for doing polarization simulations
☆17Aug 21, 2019Updated 6 years ago
cipher982 / llm-benchmarks
View on GitHub
Benchmarking LLM Inference Speeds
☆14Updated this week
StampyAI / stampy-ui
View on GitHub
AI Safety Q&A web frontend
☆41Apr 4, 2026Updated 3 months ago
brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated last year