microsoft / autorl-research
The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.
☆50Updated last year
Alternatives and similar repositories for autorl-research:
Users that are interested in autorl-research are comparing it to the libraries listed below
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆72Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆29Updated 3 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated last year
- ☆88Updated 2 years ago
- ☆108Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆51Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆81Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆18Updated 10 months ago
- An unofficial implementation for online decision transformer☆39Updated 2 years ago
- Official code repository for Prompt-DT.☆102Updated 2 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆45Updated 2 years ago
- Paper Collection for Batch RL with brief introductions.☆85Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆69Updated 2 weeks ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆32Updated this week
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆45Updated 11 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆51Updated last year
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆12Updated 3 years ago
- Conservative Q Learning on top of SAC☆123Updated 2 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- ☆123Updated 6 months ago
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 4 years ago