☆142Oct 16, 2025Updated 8 months ago
Alternatives and similar repositories for RE-Bench
Users that are interested in RE-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- METR Task Standard☆181Feb 3, 2025Updated last year
- ☆126Jun 10, 2026Updated 3 weeks ago
- ☆23Oct 15, 2022Updated 3 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆144May 6, 2026Updated last month
- Work in progress! I don't recommend looking at the code right now.☆23May 29, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- ☆13May 7, 2023Updated 3 years ago
- ☆13Dec 8, 2022Updated 3 years ago
- A python sdk for LLM finetuning and inference on runpod infrastructure☆30May 12, 2026Updated last month
- Machine Learning for Alignment Bootcamp (MLAB).☆34Jan 24, 2022Updated 4 years ago
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆14Feb 13, 2023Updated 3 years ago
- ☆49May 17, 2026Updated last month
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆220Jun 22, 2026Updated last week
- Inspect: A framework for large language model evaluations☆2,251Jun 25, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- ☆343Jun 19, 2024Updated 2 years ago
- ☆1,146Jun 22, 2026Updated last week
- Mamba support for transformer lens☆20Sep 17, 2024Updated last year
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.☆118Jun 13, 2024Updated 2 years ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆1,010Aug 14, 2024Updated last year
- Measuring the situational awareness of language models☆41Feb 12, 2024Updated 2 years ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆25Feb 26, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆305Jun 23, 2026Updated last week
- ☆20Feb 17, 2023Updated 3 years ago
- ☆26Jun 22, 2025Updated last year
- Official repo for the paper "Make Some Noise: Reliable and Efficient Single-Step Adversarial Training" (https://arxiv.org/abs/2202.01181)☆25Oct 17, 2022Updated 3 years ago
- Tools for running experiments on RL agents in procgen environments☆20Apr 5, 2024Updated 2 years ago
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆28Mar 4, 2025Updated last year
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆14Sep 4, 2024Updated last year
- ☆424Aug 21, 2025Updated 10 months ago
- ☆27Apr 1, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Collection of evals for Inspect AI☆561Updated this week
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆47May 31, 2024Updated 2 years ago
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- Code for Voice Jailbreak Attacks Against GPT-4o.☆38May 31, 2024Updated 2 years ago
- ☆12Aug 21, 2024Updated last year
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated last year
- Benchmarking Goal-Oriented Software Engineering☆175Updated this week