quantumiracle / Benchmark-Efficient-Reinforcement-Learning-with-DemonstrationsLinks
Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms include DDPG, PPO.
☆31Updated 2 years ago
Alternatives and similar repositories for Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations
Users that are interested in Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations are comparing it to the libraries listed below
Sorting:
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆109Updated 4 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆88Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 4 months ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- ☆39Updated 3 years ago
- ReinforcementLearning Learn Play Atari Using DDPG and LSTM.☆20Updated 7 years ago
- ☆60Updated 4 years ago
- ☆52Updated 6 years ago
- Assignments for CS294-112.☆30Updated 5 years ago
- There will be updates later☆85Updated 6 years ago
- ☆40Updated 3 years ago
- ☆31Updated 2 years ago
- ☆20Updated 2 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆40Updated 2 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆31Updated 2 years ago
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated 2 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆48Updated 5 years ago
- ☆121Updated 2 years ago
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆41Updated 3 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 6 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- behavior cloning from observation☆35Updated 4 years ago
- A simple RNN meta-learner☆10Updated 6 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- PyTorch implementation of Constrained Policy Optimization☆54Updated 3 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆64Updated last year
- ☆32Updated 2 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆41Updated 3 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆55Updated last year