Algorithmic-Alignment-Lab / contractsLinks
Formal Contracts for Multi-Agent Reinforcement Learning
☆19Updated 2 years ago
Alternatives and similar repositories for contracts
Users that are interested in contracts are comparing it to the libraries listed below
Sorting:
- Interpreting how transformers simulate agents performing RL tasks☆90Updated 2 years ago
- ☆144Updated 6 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Updated 2 years ago
- Skill Design From AI Feedback☆33Updated 11 months ago
- ☆110Updated last year
- Efficient baselines for autocurricula in JAX.☆206Updated last year
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Updated 2 years ago
- An Open-Ended Agentic Simulator☆58Updated last year
- Learn online intrinsic rewards from LLM feedback☆45Updated last year
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆73Updated last year
- ☆46Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆47Updated 4 years ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆107Updated 2 months ago
- SocialJax: sequential social dilemma environments☆63Updated 2 months ago
- Awesome Open-ended AI☆395Updated 2 weeks ago
- ☆47Updated last year
- ☆77Updated last year
- OMNI: Open-endedness via Models of human Notions of Interestingness☆57Updated last year
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆124Updated last year
- ☆60Updated last year
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆51Updated last year
- ☆185Updated 2 years ago
- ☆128Updated 2 years ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆211Updated 2 years ago
- Tools for studying developmental interpretability in neural networks.☆126Updated last month
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- ☆37Updated 2 years ago
- ☆31Updated 3 years ago
- ☆91Updated 2 weeks ago