jprivera44 / EscalAItion
Repo for the paper on Escalation Risks of AI systems
☆37Updated 11 months ago
Alternatives and similar repositories for EscalAItion:
Users that are interested in EscalAItion are comparing it to the libraries listed below
- ☆18Updated 8 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆66Updated last year
- ☆53Updated 6 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆54Updated last month
- Demo of using ChatGPT API for language learning☆12Updated 2 years ago
- ☆17Updated last month
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆45Updated last month
- Interpreting how transformers simulate agents performing RL tasks☆78Updated last year
- An attribution library for LLMs☆38Updated 6 months ago
- Navigating a maze using LLM agent☆39Updated 2 years ago
- The AgentForge project focuses on building general tooling to construct multicapability AI systems by composing skills and models togethe…☆17Updated last year
- Documentation for dynamic machine learning systems.☆29Updated 6 months ago
- Explainable Reinforcement Learning (XRL) Resources☆38Updated 6 months ago
- ☆130Updated 4 months ago
- General-Sum variant of the game Diplomacy for evaluating AIs.☆28Updated 11 months ago
- Repo to reproduce the First-Explore paper results☆37Updated 3 months ago
- ☆30Updated 9 months ago
- Implementation☆24Updated last week
- ☆15Updated 6 months ago
- Track the progress of LLM context utilisation☆54Updated 8 months ago
- A text-based game where language models learn to lie and to detect lies.☆12Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Machine Learning for Alignment Bootcamp☆25Updated last year
- A strongly typed Python DSL for developing message passing multi agent systems☆52Updated 11 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated 10 months ago
- ☆48Updated last year
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- we got you bro☆35Updated 8 months ago
- ☆67Updated 2 months ago
- Governance of the Commons Simulation (GovSim)☆44Updated 2 months ago