Algorithmic-Alignment-Lab / contractsLinks
Formal Contracts for Multi-Agent Reinforcement Learning
☆18Updated last year
Alternatives and similar repositories for contracts
Users that are interested in contracts are comparing it to the libraries listed below
Sorting:
- Interpreting how transformers simulate agents performing RL tasks☆87Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆129Updated last year
- ☆137Updated 8 months ago
- Skill Design From AI Feedback☆30Updated 4 months ago
- Efficient baselines for autocurricula in JAX.☆189Updated 10 months ago
- ☆98Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated 10 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆91Updated 4 months ago
- ☆82Updated 3 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆53Updated 2 years ago
- ☆56Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆173Updated 3 months ago
- An Open-Ended Agentic Simulator☆51Updated 11 months ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆30Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆73Updated 11 months ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆208Updated last year
- ☆31Updated 2 years ago
- SocialJax: sequential social dilemma environments☆41Updated last month
- Scaling scaling laws with board games.☆49Updated 2 years ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆24Updated this week
- Code and links for over 25,000 trained Atari agents☆96Updated 10 months ago
- Object Centric Atari games☆85Updated last month
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆234Updated 8 months ago
- Learn online intrinsic rewards from LLM feedback☆41Updated 7 months ago
- Nethack Learning Environment Wrapper for Language Interface☆38Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- ☆11Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆29Updated last year
- ☆143Updated last year