Algorithmic-Alignment-Lab / contractsLinks
Formal Contracts for Multi-Agent Reinforcement Learning
☆19Updated last year
Alternatives and similar repositories for contracts
Users that are interested in contracts are comparing it to the libraries listed below
Sorting:
- Interpreting how transformers simulate agents performing RL tasks☆87Updated last year
- Skill Design From AI Feedback☆31Updated 6 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆131Updated last year
- ☆138Updated last month
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆68Updated 8 months ago
- Learn online intrinsic rewards from LLM feedback☆43Updated 9 months ago
- ☆101Updated last year
- An Open-Ended Agentic Simulator☆52Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆96Updated 6 months ago
- Efficient baselines for autocurricula in JAX.☆196Updated last year
- General-Sum variant of the game Diplomacy for evaluating AIs.☆29Updated last year
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆16Updated 6 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆191Updated last month
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆30Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆238Updated 2 weeks ago
- SocialJax: sequential social dilemma environments☆45Updated 2 weeks ago
- ☆49Updated last year
- Awesome Open-ended AI☆342Updated last month
- ☆57Updated last year
- Learning Universal Predictors☆79Updated last year
- ☆31Updated 3 years ago
- ☆83Updated 2 weeks ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- Can Language Models Solve Olympiad Programming?☆118Updated 8 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆151Updated 7 months ago
- Simulation Streams is a programming paradigm designed to efficiently control and leverage Large Language Models (LLMs) for complex, dynam…☆23Updated 6 months ago
- ☆127Updated last year
- ☆99Updated 4 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated last year