grantsrb / PyTorch-A2C
General implementation of Advantage Actor Critic using Pytorch
☆27Updated 3 years ago
Alternatives and similar repositories for PyTorch-A2C:
Users that are interested in PyTorch-A2C are comparing it to the libraries listed below
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆102Updated 2 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆137Updated 2 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆93Updated 4 years ago
- Hierarchical Self-Play☆21Updated 6 years ago
- A curated list of reinforcement learning environments and frameworks.☆50Updated 6 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆37Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆45Updated 7 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Deep RL agents with PyTorch☆35Updated 3 years ago
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- Machine Learning Course Project Skoltech 2018☆108Updated 6 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- ☆82Updated 3 years ago
- PyTorch implementation of CommNet☆36Updated 7 years ago
- ☆69Updated 6 years ago
- Meta Reinforcement Learning Experiments☆34Updated 7 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆70Updated last year
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆83Updated 5 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 2 weeks ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆25Updated 6 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 4 years ago
- Efficient Exploration via State Marginal Matching (2019)☆67Updated 5 years ago