jprivera44 / EscalAItion
Repo for the paper on Escalation Risks of AI systems
☆39Updated last year
Alternatives and similar repositories for EscalAItion
Users that are interested in EscalAItion are comparing it to the libraries listed below
Sorting:
- ☆54Updated 7 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆31Updated last week
- Navigating a maze using LLM agent☆39Updated 2 years ago
- ☆22Updated last week
- General-Sum variant of the game Diplomacy for evaluating AIs.☆29Updated last year
- An attribution library for LLMs☆41Updated 8 months ago
- Extensible Booga AGI☆16Updated last year
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)☆23Updated this week
- ☆94Updated 2 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆68Updated 2 years ago
- ☆132Updated 6 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆65Updated 2 months ago
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆49Updated 2 months ago
- ☆82Updated last year
- The AgentForge project focuses on building general tooling to construct multicapability AI systems by composing skills and models togethe…☆17Updated last year
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆260Updated 7 months ago
- LLM-powered autonomous agent with hierarchical task management☆49Updated 2 years ago
- A benchmark for evaluating learning agents based on just language feedback☆74Updated last month
- Measuring the situational awareness of language models☆34Updated last year
- Governance of the Commons Simulation (GovSim)☆47Updated 4 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆51Updated 2 months ago
- ☆11Updated 9 months ago
- ☆26Updated 10 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆54Updated 5 months ago
- ☆30Updated 10 months ago
- Text-based game of lies and deceit, made for language models.☆31Updated last year
- ☆17Updated 3 months ago
- Repo to reproduce the First-Explore paper results☆37Updated 4 months ago