Collection of evals for Inspect AI
☆512May 25, 2026Updated this week
Alternatives and similar repositories for inspect_evals
Users that are interested in inspect_evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inspect: A framework for large language model evaluations☆2,096May 20, 2026Updated last week
- METR Task Standard☆180Feb 3, 2025Updated last year
- A Kubernetes sandbox environment for use with inspect_ai☆31May 14, 2026Updated last week
- An Inspect extension for agentic cyber evaluations☆29Apr 23, 2026Updated last month
- ☆34Jun 4, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆137May 18, 2026Updated last week
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆195Updated this week
- [ICML 2025] UDora: A Unified Red Teaming Framework against LLM Agents☆34Jun 24, 2025Updated 11 months ago
- ☆123Jan 19, 2026Updated 4 months ago
- ☆73May 19, 2026Updated last week
- Redwood Research's transformer interpretability tools☆15Apr 15, 2022Updated 4 years ago
- ☆23May 25, 2024Updated 2 years ago
- ☆285May 18, 2026Updated last week
- ☆137Oct 16, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A python sdk for LLM finetuning and inference on runpod infrastructure☆30May 12, 2026Updated 2 weeks ago
- ☆1,096Updated this week
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 6 months ago
- ☆106May 19, 2026Updated last week
- ☆255Apr 22, 2026Updated last month
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆1,200Updated this week
- The fastest way to install llama.cpp☆26Updated this week
- [LREC-Coling 2024] PECC: Problem Extraction and Coding Challenges☆14May 30, 2024Updated last year
- Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288☆20Oct 30, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The nnsight package enables interpreting and manipulating the internals of deep learned models.☆936Updated this week
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆259Feb 27, 2026Updated 2 months ago
- Efficient LLM inference on Slurm clusters.☆101Updated this week
- Work in progress! I don't recommend looking at the code right now.☆24May 18, 2026Updated last week
- Training Sparse Autoencoders on Language Models☆1,389Updated this week
- Open source replication of Anthropic's Crosscoders for Model Diffing☆66Oct 27, 2024Updated last year
- ☆137Feb 10, 2026Updated 3 months ago
- Inference API for many LLMs and other useful tools for empirical research☆122May 12, 2026Updated 2 weeks ago
- Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal" (ICLR 2025)☆82Mar 1, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Playing around with various jailbreaking techniques ahead of the Gray Swan AI Ultimate Jailbreaking Competition☆18Oct 6, 2024Updated last year
- ☆46May 9, 2025Updated last year
- Code implementation of R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning☆22Jul 8, 2024Updated last year
- Repo for the paper on Escalation Risks of AI systems☆44Apr 12, 2024Updated 2 years ago
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆24Oct 18, 2024Updated last year
- Machine Learning for Alignment Bootcamp☆83Apr 27, 2022Updated 4 years ago
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆566Mar 30, 2026Updated last month