jsikyoon / V-MPO_torch
V-MPO torch version with DMLab30 and GTrXL
☆12Updated 4 years ago
Alternatives and similar repositories for V-MPO_torch:
Users that are interested in V-MPO_torch are comparing it to the libraries listed below
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- ☆14Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- ☆54Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- ☆18Updated 2 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆45Updated 3 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆51Updated 3 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆33Updated last week
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆12Updated 10 months ago
- ☆17Updated 3 years ago
- ☆55Updated 2 years ago
- ☆41Updated last year
- ☆48Updated last year
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆19Updated 7 months ago
- Deep Learning Project☆21Updated 5 years ago
- ☆21Updated 10 months ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 2 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆21Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated 10 months ago
- ☆39Updated 3 months ago
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆18Updated 4 years ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆20Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated 2 years ago
- ☆22Updated 2 years ago