mike-gimelfarb / bayesian-reward-shapingLinks
Bayesian Reward Shaping Framework for Deep Reinforcement Learning
☆23Updated 6 years ago
Alternatives and similar repositories for bayesian-reward-shaping
Users that are interested in bayesian-reward-shaping are comparing it to the libraries listed below
Sorting:
- Soft Actor-Critic with advanced features☆51Updated last week
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Updated 5 years ago
- Collection of OpenAI parametrized action-space environments.☆66Updated 7 months ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 5 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆106Updated 4 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆135Updated 2 months ago
- ☆78Updated last year
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆53Updated 5 months ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆88Updated 2 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Updated 6 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆119Updated 11 months ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 2 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆55Updated last year
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆28Updated 5 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆89Updated last year
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Updated 2 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 7 months ago
- Deep Reinforcement Learning Framework done with PyTorch☆38Updated 7 months ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆96Updated 3 years ago
- Deep RL agents with PyTorch☆35Updated 4 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Updated 4 years ago
- DecentralizedLearning☆25Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆50Updated 2 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 5 years ago
- Implementation of the Option-Critic Architecture☆41Updated 6 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Updated 4 years ago