ml-jku / rudder
RUDDER: Return Decomposition for Delayed Rewards
☆47Updated 4 years ago
Alternatives and similar repositories for rudder:
Users that are interested in rudder are comparing it to the libraries listed below
- A practical step-by-step guide to applying RUDDER☆34Updated 5 years ago
- Code for demonstration example-task in RUDDER blog☆22Updated 4 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- Pytorch implementation of distributed deep reinforcement learning☆75Updated 2 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆38Updated 2 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆69Updated last year
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆21Updated 5 years ago
- ☆30Updated 5 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆31Updated 3 weeks ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 5 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆101Updated 3 weeks ago
- Revisiting Rainbow☆73Updated 3 years ago
- ☆71Updated 7 months ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆16Updated 5 years ago
- Hierarchical Self-Play☆21Updated 6 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)