acsresearch / interlab
☆18Updated 7 months ago
Alternatives and similar repositories for interlab:
Users that are interested in interlab are comparing it to the libraries listed below
- A dataset of alignment research and code to reproduce it☆73Updated last year
- General-Sum variant of the game Diplomacy for evaluating AIs.☆29Updated 10 months ago
- Interpreting how transformers simulate agents performing RL tasks☆77Updated last year
- Mechanistic Interpretability for Transformer Models☆49Updated 2 years ago
- METR Task Standard☆142Updated 2 weeks ago
- Machine Learning for Alignment Bootcamp☆70Updated 2 years ago
- Redwood Research's transformer interpretability tools☆14Updated 2 years ago
- ☆83Updated last month
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆78Updated this week
- ☆50Updated 4 months ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Updated 3 years ago
- ☆18Updated 2 years ago
- Tools for running experiments on RL agents in procgen environments☆16Updated 10 months ago
- (Model-written) LLM evals library☆18Updated 2 months ago
- ☆128Updated 3 months ago
- ☆12Updated last year
- Benchmarking Agentic LLM and VLM Reasoning On Games☆115Updated this week
- Repo for the paper on Escalation Risks of AI systems☆36Updated 10 months ago
- ☆61Updated 3 weeks ago
- Machine Learning for Alignment Bootcamp☆25Updated 11 months ago
- we got you bro☆35Updated 6 months ago
- Awesome Open-ended AI☆204Updated 4 months ago
- Language-annotated Abstraction and Reasoning Corpus☆82Updated last year
- ☆11Updated last year
- Machine Learning for Alignment Bootcamp (MLAB).☆25Updated 3 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Updated last year
- A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).☆31Updated 2 years ago
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆11Updated 4 months ago