quantumiracle / Benchmark-Efficient-Reinforcement-Learning-with-DemonstrationsLinks
Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms include DDPG, PPO.
☆33Updated 3 years ago
Alternatives and similar repositories for Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations
Users that are interested in Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations are comparing it to the libraries listed below
Sorting:
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆43Updated 7 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- There will be updates later☆88Updated 6 years ago
- Assignments for CS294-112.☆30Updated 6 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆90Updated 5 years ago
- Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)☆116Updated 3 years ago
- ☆54Updated 7 years ago
- ppo-lstm-parallel☆49Updated 6 years ago
- ☆62Updated 5 years ago
- ☆123Updated 2 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆41Updated 4 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆111Updated 3 years ago
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆36Updated 4 years ago
- A simple RNN meta-learner☆10Updated 7 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆88Updated 8 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 7 years ago
- Single-file pytorch implementation of hybrid-SAC☆64Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆54Updated 11 months ago
- ☆20Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆137Updated last year
- ☆40Updated 4 years ago
- ☆48Updated 3 years ago
- ☆49Updated 4 years ago
- ☆49Updated 5 years ago
- Public implementation of "Multi-Agent Graph-Attention Communication and Teaming" from AAMAS'21☆90Updated last year
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Updated 7 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆89Updated 7 years ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆28Updated 6 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆42Updated 3 years ago
- ☆25Updated 3 years ago