chaovven / PyRL
PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)
☆34Updated 2 years ago
Alternatives and similar repositories for PyRL:
Users that are interested in PyRL are comparing it to the libraries listed below
- ☆73Updated 8 months ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 5 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆101Updated last month
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆69Updated last year
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 4 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 5 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- ☆111Updated last year
- ☆91Updated last year
- ☆26Updated 6 years ago
- PyTorch IMPALA implementation☆25Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆116Updated 3 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆45Updated last year
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- FEN Code☆37Updated 5 years ago
- A library of probabilistic model based RL algorithms in pytorch☆108Updated 3 years ago
- Learning Individual Intrinsic Reward in MARL☆63Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆173Updated 2 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 5 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- ☆97Updated last year
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Hierarchical Self-Play☆21Updated 6 years ago