cassidylaidlaw / boltzmann-policy-distributionLinks
Code and pretrained models for the ICLR 2022 paper "The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models"
☆10Updated 2 years ago
Alternatives and similar repositories for boltzmann-policy-distribution
Users that are interested in boltzmann-policy-distribution are comparing it to the libraries listed below
Sorting:
- Overcooked human-AI experiment platform☆38Updated last year
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆28Updated 2 years ago
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆40Updated 9 months ago
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆30Updated 11 months ago
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆58Updated 8 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆36Updated 4 months ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆14Updated 3 months ago
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆18Updated 4 years ago
- ☆31Updated 2 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆27Updated 2 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆26Updated 2 years ago
- DecentralizedLearning☆24Updated 2 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆19Updated last year
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆18Updated 4 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆24Updated 2 months ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- Scripts to recreate the D4RL datasets with Minari☆19Updated last month
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆22Updated 7 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 6 months ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆30Updated 3 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆32Updated 6 months ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆18Updated last year
- Code for the paper "Learning to Assist Humans without Inferring Rewards"☆15Updated 11 months ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago