luchris429 / model-free-opponent-shaping
Code for Model-Free Opponent Shaping (ICML 2022)
☆16Updated 2 years ago
Alternatives and similar repositories for model-free-opponent-shaping:
Users that are interested in model-free-opponent-shaping are comparing it to the libraries listed below
- Scalable Opponent Shaping Experiments in JAX☆24Updated 10 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- Implementation of the Off Belief Learning algorithm.☆45Updated 2 years ago
- ☆41Updated 3 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 4 years ago
- ☆29Updated 2 years ago
- ☆54Updated 11 months ago
- ☆30Updated 5 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- ☆17Updated 2 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- ☆40Updated 3 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆32Updated 7 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆102Updated 2 years ago
- ☆18Updated 2 years ago
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆17Updated 3 months ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 2 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆52Updated 3 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆17Updated 3 years ago
- Code for demonstration example-task in RUDDER blog☆22Updated 4 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆43Updated 7 months ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 4 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆18Updated 2 years ago