jprivera44 / EscalAItionLinks
Repo for the paper on Escalation Risks of AI systems
☆44Updated last year
Alternatives and similar repositories for EscalAItion
Users that are interested in EscalAItion are comparing it to the libraries listed below
Sorting:
- ☆59Updated 3 weeks ago
- ☆20Updated last year
- An attribution library for LLMs☆43Updated last year
- ☆104Updated this week
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆55Updated 7 months ago
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆57Updated 10 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆127Updated last year
- A repo to evaluate various LLM's chess playing abilities.☆82Updated last year
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆78Updated 10 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆151Updated 8 months ago
- ☆86Updated last year
- ☆98Updated 2 months ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆24Updated 2 months ago
- ☆137Updated 2 months ago
- Just a bunch of benchmark logs for different LLMs☆118Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆120Updated 6 months ago
- ☆75Updated 6 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 7 months ago
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆94Updated last year
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆296Updated 3 months ago
- Governance of the Commons Simulation (GovSim)☆59Updated 9 months ago
- ☆138Updated last week
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆89Updated 2 weeks ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆187Updated 7 months ago
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆269Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆240Updated last month
- Automated Capability Discovery via Foundation Model Self-Exploration☆64Updated 8 months ago
- Automating enterprise workflows with multimodal agents☆112Updated last year
- A dataset of alignment research and code to reproduce it☆77Updated 2 years ago