alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆49Updated 2 years ago
Alternatives and similar repositories for PPO-RND:
Users that are interested in PPO-RND are comparing it to the libraries listed below
- ☆49Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆126Updated 8 months ago
- PyTorch IMPALA implementation☆26Updated 5 years ago
- ☆75Updated 10 months ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 4 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆45Updated 3 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 2 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Soft Actor-Critic with advanced features☆49Updated this week
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆102Updated 2 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 2 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- ☆109Updated 2 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆40Updated 6 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆57Updated 4 years ago
- ☆53Updated last year
- Learning Individual Intrinsic Reward in MARL☆62Updated 2 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆39Updated 4 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 9 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆139Updated 6 years ago
- DecentralizedLearning☆24Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆77Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆84Updated last year
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Distributional Soft Actor Critic☆52Updated 4 years ago
- DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details☆46Updated 3 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆33Updated 5 years ago
- Model-Based Offline Reinforcement Learning☆50Updated 4 years ago