mike-gimelfarb / bayesian-reward-shapingLinks
Bayesian Reward Shaping Framework for Deep Reinforcement Learning
☆23Updated 6 years ago
Alternatives and similar repositories for bayesian-reward-shaping
Users that are interested in bayesian-reward-shaping are comparing it to the libraries listed below
Sorting:
- Soft Actor-Critic with advanced features☆51Updated 2 weeks ago
- Safe Reinforcement Learning algorithms☆75Updated 3 years ago
- Collection of OpenAI parametrized action-space environments.☆66Updated 6 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Updated 6 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Updated 2 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 5 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆146Updated 3 years ago
- OpenAi's gym environment wrapper to vectorize them with Ray☆23Updated 2 years ago
- ☆78Updated last year
- Deep Reinforcement Learning for Continuous Control in PyTorch☆104Updated 3 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆134Updated last month
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆120Updated 11 months ago
- Combining Evolutionary Algorithms and deep RL in various ways☆105Updated 4 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Updated 6 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Updated 4 years ago
- Emergence of complex strategies through multiagent competition☆44Updated 2 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 7 months ago
- Solving POMDP using Recurrent networks☆91Updated 5 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 6 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Updated 6 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 4 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆128Updated 4 years ago
- Gym environments modified with adversarial agents☆36Updated 8 years ago
- PyTorch IMPALA implementation☆28Updated 6 years ago
- Experiments with reinforcement learning and recurrent neural networks☆115Updated last year
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 4 years ago