jprivera44 / EscalAItion
Repo for the paper on Escalation Risks of AI systems
☆36Updated 9 months ago
Alternatives and similar repositories for EscalAItion:
Users that are interested in EscalAItion are comparing it to the libraries listed below
- Demo of using ChatGPT API for language learning☆12Updated last year
- A benchmark for evaluating learning agents based on just language feedback☆63Updated 3 months ago
- ☆48Updated 3 months ago
- A text-based game where language models learn to lie and to detect lies.☆12Updated last year
- ☆51Updated last week
- ☆14Updated 3 months ago
- General-Sum variant of the game Diplomacy for evaluating AIs.☆29Updated 9 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆47Updated 7 months ago
- A dataset of alignment research and code to reproduce it☆73Updated last year
- ☆25Updated 9 months ago
- An attribution library for LLMs☆35Updated 4 months ago
- Text-based game of lies and deceit, made for language models.☆30Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆77Updated last year
- ☆23Updated 7 months ago
- Governance of the Commons Simulation (GovSim)☆31Updated 6 months ago
- ☆79Updated last week
- ☆46Updated 2 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆117Updated last week
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆64Updated last year
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆63Updated last month
- ☆55Updated 2 months ago
- ☆18Updated 6 months ago
- Github repo for storing LlamaDatasets☆32Updated last year
- LLM-powered autonomous agent with hierarchical task management☆47Updated last year
- ☆125Updated 2 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- Small, simple agent task environments for training and evaluation☆18Updated 2 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆46Updated last month
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆19Updated 4 months ago
- Measuring the situational awareness of language models☆33Updated 11 months ago