Miffyli / rl-action-space-shaping
Experiment code for testing effect of various action space transformations in reinforcement learning
☆30Updated 4 years ago
Alternatives and similar repositories for rl-action-space-shaping:
Users that are interested in rl-action-space-shaping are comparing it to the libraries listed below
- PyTorch IMPALA implementation☆25Updated 5 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 5 years ago
- ☆71Updated 7 months ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆51Updated 4 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆44Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆38Updated 3 months ago
- ☆21Updated 9 months ago
- ☆54Updated 11 months ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆51Updated 3 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- ☆18Updated 2 years ago
- Code for demonstration example-task in RUDDER blog☆22Updated 4 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆21Updated 5 years ago
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆39Updated 2 years ago
- ☆81Updated 3 years ago
- ☆35Updated 3 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Implementation of the Option-Critic Architecture☆37Updated 6 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 2 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆31Updated 2 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Updated 5 years ago
- ☆97Updated last year
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆44Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Learning Individual Intrinsic Reward in MARL☆63Updated 2 years ago