quantumiracle / Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations
Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms include DDPG, PPO.
☆29Updated 2 years ago
Alternatives and similar repositories for Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations:
Users that are interested in Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations are comparing it to the libraries listed below
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆38Updated 6 years ago
- ReinforcementLearning Learn Play Atari Using DDPG and LSTM.☆20Updated 7 years ago
- Assignments for CS294-112.☆30Updated 5 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆99Updated 3 years ago
- ☆40Updated 3 years ago
- There will be updates later☆84Updated 5 years ago
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆34Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆86Updated 4 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆34Updated 3 years ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆14Updated 5 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 6 years ago
- ☆26Updated 2 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Updated 2 years ago
- Collection of OpenAI parametrized action-space environments.☆62Updated last year
- ☆120Updated 2 years ago
- ☆53Updated 6 years ago
- ☆33Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated 4 years ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆31Updated 2 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- ☆94Updated 3 years ago
- ☆19Updated 2 years ago
- Learning Individual Intrinsic Reward in MARL☆63Updated 2 years ago
- ☆32Updated 2 years ago
- PyTorch implementation of Deep Reinforcement Algorithm☆30Updated 2 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆32Updated 5 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆42Updated 5 years ago
- behavior cloning from observation☆35Updated 4 years ago
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Updated 5 years ago