mike-gimelfarb / bayesian-reward-shaping
Bayesian Reward Shaping Framework for Deep Reinforcement Learning
☆21Updated 5 years ago
Related projects: ⓘ
- A curated list of awesome Model-based reinforcement learning resources☆88Updated 4 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆25Updated 5 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆63Updated last year
- ☆69Updated 3 months ago
- OpenAI Gym environment for Robot Soccer Goal☆17Updated 5 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆75Updated 9 months ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- Implementation of the Option-Critic Architecture☆37Updated 5 years ago
- Safe Reinforcement Learning algorithms☆69Updated 2 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54Updated 5 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆18Updated 5 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆36Updated last year
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆53Updated 4 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆44Updated 4 years ago
- ☆95Updated last year
- Hierarchical Self-Play☆21Updated 5 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆42Updated last year
- A Multi-agent Learning Framework☆61Updated 3 years ago
- Hierarchical Online Planning and Reinforcement Learning on Taxi☆30Updated 6 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Soft Actor-Critic with advanced features☆47Updated 3 weeks ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 4 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆48Updated 3 years ago
- Distributional Soft Actor Critic☆49Updated 4 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆53Updated last year
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 5 years ago
- Emergence of complex strategies through multiagent competition☆40Updated last year
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Updated 4 years ago