microsoft / autorl-research
The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.
☆48Updated last year
Related projects ⓘ
Alternatives and complementary repositories for autorl-research
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆17Updated 2 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- An unofficial implementation for online decision transformer☆37Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆45Updated 8 months ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆17Updated 3 months ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆71Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆54Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆26Updated 5 months ago
- Official code repository for Prompt-DT.☆98Updated 2 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated last year
- Implementation of Multi-Game Decision Transformers in PyTorch☆43Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆49Updated last year
- ☆28Updated last year
- Code for FOCAL Paper Published at ICLR 2021☆49Updated 11 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated last year
- [ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.☆56Updated last year
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆37Updated 3 years ago
- ☆30Updated 3 months ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 4 years ago
- ☆86Updated 2 years ago
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆36Updated 2 years ago
- ☆17Updated 2 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆46Updated last year
- ☆19Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆25Updated 3 months ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆109Updated this week
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆25Updated last year
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆12Updated 3 years ago
- Paper Collection for Batch RL with brief introductions.☆85Updated 2 years ago