METR Task Standard
☆177Feb 3, 2025Updated last year
Alternatives and similar repositories for task-standard
Users that are interested in task-standard are comparing it to the libraries listed below
Sorting:
- ☆119Jan 19, 2026Updated last month
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆134Feb 15, 2026Updated 2 weeks ago
- ☆33Jun 4, 2025Updated 8 months ago
- Collection of evals for Inspect AI☆380Updated this week
- Inspect: A framework for large language model evaluations☆1,783Updated this week
- Situational Awareness Dataset☆46Dec 14, 2024Updated last year
- A Kubernetes sandbox environment for use with inspect_ai☆27Updated this week
- Einsum with einops style variable names☆18May 16, 2024Updated last year
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- ☆25Nov 11, 2025Updated 3 months ago
- Pin files for contextual, codebase-level AI assistance.☆16Jul 11, 2024Updated last year
- ☆12Jul 12, 2024Updated last year
- 🔥 A repository for collecting cyberdefense thoughts, books, and documents about AI cyberdefense☆13Jul 2, 2023Updated 2 years ago
- A library for mechanistic interpretability of GPT-style language models☆3,112Updated this week
- ☆944Updated this week
- ☆17Updated this week
- ☆330Jul 2, 2024Updated last year
- anything you want can be built with morph cloud☆27Oct 14, 2025Updated 4 months ago
- we got you bro☆38Jul 29, 2024Updated last year
- A formalisation of Cartesian Frames, a perspective on embedded agency, in the HOL theorem prover.☆20Dec 20, 2021Updated 4 years ago
- ☆45Feb 13, 2026Updated 2 weeks ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆67Feb 12, 2025Updated last year
- ☆65Feb 20, 2026Updated last week
- Machine Learning for Alignment Bootcamp☆82Apr 27, 2022Updated 3 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.☆116Jun 13, 2024Updated last year
- ☆22Sep 9, 2021Updated 4 years ago
- Training Sparse Autoencoders on Language Models☆1,219Updated this week
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆238Aug 11, 2025Updated 6 months ago
- ☆36Jul 4, 2025Updated 7 months ago
- ☆147Jul 23, 2025Updated 7 months ago
- Realtime News and Information Eval☆17Nov 19, 2025Updated 3 months ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- Take a picture, get a 3D print of it!☆12Dec 26, 2024Updated last year
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆37Apr 21, 2025Updated 10 months ago
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆31Mar 20, 2025Updated 11 months ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆217Feb 23, 2026Updated last week
- ☆117Feb 11, 2025Updated last year
- Bindings for the Anthropic API☆11Apr 15, 2025Updated 10 months ago