LilTwo / DRL-using-PyTorch
PyTorch implementation of Deep Reinforcement Algorithm
☆30Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for DRL-using-PyTorch
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- ☆83Updated 5 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆132Updated 3 months ago
- Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…☆27Updated last year
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆70Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆96Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated last year
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆55Updated 2 years ago
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆37Updated 2 years ago
- ☆13Updated 5 years ago
- ☆119Updated last year
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 5 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆82Updated 4 years ago
- DSAC; Distributional Soft Actor-Critic☆114Updated 9 months ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Distributional Soft Actor Critic☆49Updated 4 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆93Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆98Updated 4 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆61Updated 6 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆53Updated last year
- behavior cloning from observation☆35Updated 3 years ago
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆33Updated 3 years ago
- Soft Actor-Critic with advanced features☆47Updated last month
- Learning Individual Intrinsic Reward in MARL☆62Updated last year
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆99Updated 4 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 2 years ago
- ☆47Updated 5 years ago