alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆42Updated last year
Related projects: ⓘ
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆34Updated 4 years ago
- ☆69Updated 3 months ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 3 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆98Updated 3 years ago
- ☆44Updated 3 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated last year
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- V-MPO torch version with DMLab30 and GTrXL☆12Updated 3 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆32Updated 5 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆55Updated 3 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆44Updated 5 years ago
- Deep Implicit Coordination Graphs☆40Updated 3 months ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆61Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆91Updated 2 years ago
- PyTorch IMPALA implementation☆24Updated 5 years ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆16Updated 4 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Updated 5 years ago
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆31Updated 5 years ago
- Soft Actor-Critic with advanced features☆47Updated 3 weeks ago
- ☆43Updated last year
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆113Updated last month
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- Distributional Soft Actor Critic☆49Updated 4 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 5 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 2 years ago
- Implementation of the Option-Critic Architecture☆37Updated 5 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆37Updated 2 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆48Updated 3 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆83Updated last year