jprivera44 / EscalAItionLinks
Repo for the paper on Escalation Risks of AI systems
☆44Updated last year
Alternatives and similar repositories for EscalAItion
Users that are interested in EscalAItion are comparing it to the libraries listed below
Sorting:
- Public repository containing METR's DVC pipeline for eval data analysis☆138Updated 7 months ago
- ☆142Updated 4 months ago
- ☆62Updated 2 months ago
- ☆108Updated last week
- General-Sum variant of the game Diplomacy for evaluating AIs.☆32Updated last year
- How to create rational LLM-based agents? Using game-theoretic workflows!☆84Updated 5 months ago
- ☆221Updated 2 years ago
- Problem solving by engaging multiple AI agents in conversation with each other and the user.☆230Updated last year
- ☆79Updated last year
- Governance of the Commons Simulation (GovSim)☆61Updated 10 months ago
- ☆53Updated last year
- A virtual environment for developing and evaluating automated scientific discovery agents.☆191Updated 8 months ago
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆269Updated last year
- ☆75Updated 7 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆151Updated 9 months ago
- Sphynx Hallucination Induction☆53Updated 10 months ago
- ☆104Updated 4 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆307Updated 4 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆131Updated last year
- An attribution library for LLMs☆46Updated last year
- A benchmark for evaluating learning agents based on just language feedback☆92Updated 5 months ago
- Interaction-first method for generating demonstrations for web-agents on any website☆51Updated 7 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆71Updated 2 years ago
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆48Updated last year
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆214Updated last week
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆26Updated 4 months ago
- Specification for creating reliable LLM-based conversational agents☆64Updated last month
- Automated Capability Discovery via Foundation Model Self-Exploration☆65Updated 9 months ago
- A repo to evaluate various LLM's chess playing abilities.☆85Updated last year