quantumiracle / Benchmark-Efficient-Reinforcement-Learning-with-DemonstrationsLinks
Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms include DDPG, PPO.
☆32Updated 2 years ago
Alternatives and similar repositories for Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations
Users that are interested in Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations are comparing it to the libraries listed below
Sorting:
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆43Updated 6 years ago
- There will be updates later☆87Updated 6 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆90Updated 5 years ago
- ☆40Updated 4 years ago
- Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)☆115Updated 3 years ago
- Assignments for CS294-112.☆30Updated 6 years ago
- ☆20Updated 2 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆40Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆54Updated 9 months ago
- ☆122Updated 2 years ago
- ☆54Updated 6 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆134Updated 9 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆207Updated last year
- Single-file pytorch implementation of hybrid-SAC☆61Updated 4 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆107Updated 3 years ago
- ☆40Updated 3 years ago
- ReinforcementLearning Learn Play Atari Using DDPG and LSTM.☆20Updated 8 years ago
- ☆48Updated 5 years ago
- Code repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)☆25Updated 4 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆40Updated 2 years ago
- ☆45Updated 3 years ago
- ☆100Updated 5 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆41Updated 3 years ago
- A simple RNN meta-learner☆10Updated 6 years ago
- ☆49Updated 4 years ago
- ☆106Updated 4 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 7 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆55Updated 4 years ago