METR / vivariaLinks

Vivaria is METR's tool for running evaluations and conducting agent elicitation research.

☆120

Alternatives and similar repositories for vivaria

Users that are interested in vivaria are comparing it to the libraries listed below

Sorting:

METR / task-standard
METR Task Standard
☆168Updated 10 months ago
poking-agents / modular-public
☆32Updated 6 months ago
METR / RE-Bench
☆119Updated last month
METR / public-tasks
☆108Updated 2 weeks ago
haizelabs / sphynx
Sphynx Hallucination Induction
☆53Updated 10 months ago
google-deepmind / mishax
☆144Updated 2 months ago
TransluceAI / observatory
A toolkit for describing model features and intervening on those features to steer behavior.
☆216Updated last year
google-deepmind / dangerous-capability-evaluations
☆62Updated 2 months ago
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆163Updated 7 months ago
haizelabs / verdict
Inference-time scaling for LLMs-as-a-judge.
☆314Updated last month
UKGovernmentBEIS / inspect_evals
Collection of evals for Inspect AI
☆290Updated last week
PrimeIntellect-ai / prime-environments
Training-Ready RL Environments + Evals
☆182Updated this week
emergent-misalignment / emergent-misalignment
☆229Updated this week
UKGovernmentBEIS / control-arena
ControlArena is a collection of settings, model organisms and protocols - for running control experiments.
☆132Updated this week
TransluceAI / docent
☆63Updated 2 months ago
princeton-pli / hal-harness
☆191Updated this week
goodfire-ai / scribe
☆54Updated 2 months ago
haizelabs / Awesome-LLM-Judges
⚖️ Awesome LLM Judges ⚖️
☆134Updated 7 months ago
ScalingIntelligence / Archon
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆189Updated 8 months ago
leap-laboratories / PIZZA
An attribution library for LLMs
☆46Updated last year
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆240Updated 9 months ago
anthropics / evals
☆315Updated last year
haizelabs / bijection-learning
☆26Updated last year
anthropics / sleeper-agents-paper
Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".
☆122Updated last year
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆248Updated last year
callummcdougall / sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆231Updated 11 months ago
LeonGuertler / UnstableBaselines
☆106Updated last month
PrimeIntellect-ai / genesys
☆136Updated 8 months ago
rgreenblatt / arc_draw_more_samples_pub
Draw more samples
☆196Updated last year
EleutherAI / elk
Keeping language models honest by directly eliciting knowledge encoded in their activations.
☆214Updated this week