grantsrb / PyTorch-A2CLinks

General implementation of Advantage Actor Critic using Pytorch

☆28

Alternatives and similar repositories for PyTorch-A2C

Users that are interested in PyTorch-A2C are comparing it to the libraries listed below

Sorting:

jcwleo / curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
☆139Updated 2 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆96Updated 3 years ago
rraileanu / auto-drac
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
☆102Updated 2 years ago
lnpalmer / A2C
PyTorch implementation of Advantage Actor-Critic (A2C)
☆46Updated 7 years ago
johannah / bootstrap_dqn
Implementation of Bootstrap DQN and Randomized Prior Functions on ALE
☆54Updated 4 months ago
xlnwel / model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
☆62Updated 4 years ago
ASzot / ppo-pytorch
Proximal policy optimization in PyTorch. Easy to read and understand.
☆50Updated 4 years ago
jcwleo / random-network-distillation-pytorch
Random Network Distillation pytorch
☆251Updated 6 years ago
yfletberliac / adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
☆48Updated 3 years ago
cyoon1729 / distributedRL
A framework for easy prototyping of distributed reinforcement learning algorithms
☆96Updated 4 years ago
bsivanantham / GAE
Reinforcement learning algorithms with Generalized Advantage Estimation
☆21Updated 7 years ago
rmst / rtrl
PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)
☆74Updated 5 years ago
tesatory / hsp
Hierarchical Self-Play
☆21Updated 6 years ago
BY571 / Munchausen-RL
PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN
☆45Updated 4 years ago
createamind / DRL
☆92Updated 4 years ago
sadeqa / Super-Mario-Bros-RL
This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…
☆79Updated 6 years ago
wizdom13 / RND-Pytorch
Random Network Distillation(RND) algo in Pytorch
☆50Updated 6 years ago
microsoft / oac-explore
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆69Updated last year
nikhilbarhate99 / Deterministic-GAIL-PyTorch
PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning
☆67Updated 5 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
jcwleo / mario_rl
☆69Updated 6 years ago
cjm715 / mgym
A collection of multi-agent reinforcement learning OpenAI gym environments
☆45Updated 5 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆88Updated 4 years ago
intel / cerl
☆72Updated 2 years ago
lcswillems / torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
☆205Updated 2 years ago
dongminlee94 / Reinforcement-Learning-Code
A repository for code of reinforcement learning algorithms with PyTorch
☆30Updated 3 years ago
AIcrowd / neurips2020-procgen-starter-kit
Starter Kit for NeurIPS 2020 - Procgen Competition on AIcrowd
☆91Updated 2 years ago
marctuscher / DRQN-tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
☆175Updated 2 years ago
neka-nat / distributed_rl
Pytorch implementation of distributed deep reinforcement learning
☆76Updated 3 years ago