☆73Jun 5, 2026Updated last week
Alternatives and similar repositories for dangerous-capability-evaluations
Users that are interested in dangerous-capability-evaluations are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Inspect extension for agentic cyber evaluations☆29May 28, 2026Updated 2 weeks ago
- micro-gpt in ASM on the Super Nintendo☆67Feb 12, 2026Updated 4 months ago
- ☆124Jan 19, 2026Updated 4 months ago
- ☆18May 6, 2023Updated 3 years ago
- Python3 library for sophisticated timing attacks using Gaussian Mixture Model.☆22Apr 10, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆138May 18, 2026Updated 3 weeks ago
- A spoken version of the textual story cloze benchmark☆22Aug 6, 2023Updated 2 years ago
- Tools for running experiments on RL agents in procgen environments☆20Apr 5, 2024Updated 2 years ago
- METR Task Standard☆181Feb 3, 2025Updated last year
- ☆66Sep 13, 2025Updated 9 months ago
- GeoGraph provides a tool for analysing habitat fragmentation and related problems in landscape ecology. GeoGraph builds a geospatially re…☆41Apr 12, 2024Updated 2 years ago
- Remote code execution in Power Platform connectors via JSON deserialization☆23Mar 30, 2023Updated 3 years ago
- Make inso available in your GitHub Actions workflows☆11Jul 16, 2025Updated 10 months ago
- ☆11Nov 27, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- CompChomper is a framework for measuring how LLMs perform at code completion.☆21Apr 29, 2025Updated last year
- Official repository for CVPR'23 paper: Detecting Backdoors in Pre-trained Encoders☆38Sep 25, 2023Updated 2 years ago
- Explore, Establish, Exploit: Red Teaming Language Models from Scratch☆15Jun 21, 2023Updated 2 years ago
- 汇编语言学习的例子☆10Aug 5, 2021Updated 4 years ago
- A formalisation of Cartesian Frames, a perspective on embedded agency, in the HOL theorem prover.☆22Dec 20, 2021Updated 4 years ago
- Small, simple agent task environments for training and evaluation☆20Nov 1, 2024Updated last year
- ☆15Sep 22, 2023Updated 2 years ago
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago
- ☆36Mar 6, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Testing LLMs reflection and planning capabilities with gym environments☆14Aug 30, 2024Updated last year
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆201Updated this week
- A system to improve compatibility between different Django versions, and make upgrading dependencies less painful.☆13Apr 13, 2026Updated 2 months ago
- Make it easy to automatically and uniformly measure the behavior of many AI Systems.☆27Oct 2, 2024Updated last year
- Reinforcement Learning Replications is a set of Pytorch implementations of reinforcement learning algorithms.☆24Apr 4, 2026Updated 2 months ago
- Central repo for talks and presentations☆48Jul 23, 2024Updated last year
- Collection of evals for Inspect AI☆529Updated this week
- I am still working on it☆11Apr 30, 2020Updated 6 years ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆220Jun 8, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening☆10Dec 18, 2025Updated 5 months ago
- ☆10Nov 15, 2023Updated 2 years ago
- ☆15Mar 7, 2025Updated last year
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆29Jun 4, 2024Updated 2 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- ☆151Jan 17, 2025Updated last year
- An application that displays a map and graphs showing solar irradiance forecasts in solar farms in Georgia using data from the National S…☆10Oct 15, 2021Updated 4 years ago