Howuhh / evolution_strategies_openai
implementation of "Evolution Strategies as a Scalable Alternative to Reinforcement Learning" OpenAI paper
☆17Updated 3 years ago
Related projects: ⓘ
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆43Updated 6 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated last year
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)☆21Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆42Updated last year
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 2 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆30Updated 3 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆45Updated 3 months ago
- Multi-Agent Determinantal Q-Learning☆41Updated last year
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆34Updated 5 years ago
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆32Updated 3 years ago
- Deep Implicit Coordination Graphs☆40Updated 3 months ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆48Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆17Updated 2 years ago
- ☆10Updated this week
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Updated 4 years ago
- ☆69Updated 3 months ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆37Updated 2 years ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆20Updated last year
- The Starcraft Multi-Agent challenge lite☆32Updated last week
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆34Updated 4 years ago
- GAIL learning to imitate PPO playing CartPole.☆12Updated 3 years ago
- Distributed Deep Reinforcement Learning☆29Updated 3 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆19Updated last year
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆46Updated last year
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆34Updated 2 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆24Updated 3 months ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆21Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆24Updated 2 years ago