jidiai / SummerCourse2021
☆25Updated 3 years ago
Alternatives and similar repositories for SummerCourse2021:
Users that are interested in SummerCourse2021 are comparing it to the libraries listed below
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆29Updated 3 years ago
- ☆162Updated last year
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- ☆42Updated last week
- Implement many Sparse Reward algorithms in Gym Fetch environment☆85Updated 4 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆94Updated 3 years ago
- ☆122Updated 3 years ago
- Assignments for CS294-112.☆30Updated 5 years ago
- ☆97Updated 4 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated 2 years ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆21Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- Paper list for constrained policy optimization in reinforcement learning.☆71Updated last year
- Re-implementations of SOTA RL algorithms.☆129Updated last year
- Implementation of Multi-Game Decision Transformers in PyTorch☆46Updated 2 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆73Updated 2 years ago
- ☆17Updated 2 years ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆21Updated 3 years ago
- A pack of reinforcement learning algorithms.☆83Updated 3 years ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆117Updated 4 months ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆69Updated 2 years ago
- ☆28Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- ☆127Updated 7 months ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 2 years ago
- ☆38Updated 2 years ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆26Updated 3 years ago