Algorithmic-Alignment-Lab / contractsLinks
Formal Contracts for Multi-Agent Reinforcement Learning
☆19Updated last year
Alternatives and similar repositories for contracts
Users that are interested in contracts are comparing it to the libraries listed below
Sorting:
- Skill Design From AI Feedback☆31Updated 7 months ago
- Interpreting how transformers simulate agents performing RL tasks☆88Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆131Updated last year
- ☆137Updated 2 months ago
- Efficient baselines for autocurricula in JAX.☆196Updated last year
- ☆45Updated last year
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆68Updated 9 months ago
- An Open-Ended Agentic Simulator☆52Updated last year
- Learn online intrinsic rewards from LLM feedback☆43Updated 10 months ago
- ☆31Updated 3 years ago
- ☆48Updated last year
- SocialJax: sequential social dilemma environments☆47Updated 2 weeks ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆98Updated 2 weeks ago
- ☆16Updated last week
- ☆103Updated last year
- General-Sum variant of the game Diplomacy for evaluating AIs.☆31Updated last year
- Object Centric Atari games☆91Updated this week
- ☆12Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆240Updated last month
- A domain-specific probabilistic programming language for modeling and inference with language models☆136Updated 5 months ago
- Awesome Open-ended AI☆348Updated 2 months ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆116Updated last year
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆55Updated 2 years ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆57Updated 8 months ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆32Updated last year
- Redwood Research's transformer interpretability tools☆14Updated 3 years ago
- ☆74Updated last year
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- ☆128Updated last year