dgriff777/rl_a3c_pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dgriff777/rl_a3c_pytorch)

dgriff777 / rl_a3c_pytorch

A3C LSTM Atari with Pytorch plus A3G design

☆570

Alternatives and similar repositories for rl_a3c_pytorch

Users that are interested in rl_a3c_pytorch are comparing it to the libraries listed below

Sorting:

ikostrikov / pytorch-a3c
View on GitHub
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
☆1,316Sep 25, 2019Updated 6 years ago
dgriff777 / a3c_continuous
View on GitHub
A continuous action space version of A3C LSTM in pytorch plus A3G design
☆260Oct 11, 2024Updated last year
jingweiz / pytorch-rl
View on GitHub
Deep Reinforcement Learning with pytorch & visdom
☆804Jul 16, 2020Updated 5 years ago
ikostrikov / pytorch-a2c-ppo-acktr-gail
View on GitHub
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…
☆3,876May 29, 2022Updated 3 years ago
NVlabs / GA3C
View on GitHub
Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.
☆662Feb 25, 2020Updated 6 years ago
akolishchak / doom-net-pytorch
View on GitHub
Reinforcement learning models in ViZDoom environment
☆130Mar 9, 2022Updated 3 years ago
floringogianu / categorical-dqn
View on GitHub
A working implementation of the Categorical DQN (Distributional RL).
☆95Apr 7, 2018Updated 7 years ago
Kaixhin / ACER
View on GitHub
Actor-critic with experience replay
☆257Oct 9, 2022Updated 3 years ago
ikostrikov / pytorch-ddpg-naf
View on GitHub
Implementation of algorithms for continuous control (DDPG and NAF).
☆313Feb 16, 2021Updated 5 years ago
ShangtongZhang / DeepRL
View on GitHub
Modularized Implementation of Deep RL Algorithms in PyTorch
☆3,412Apr 16, 2024Updated last year
Kaixhin / Rainbow
View on GitHub
Rainbow: Combining Improvements in Deep Reinforcement Learning
☆1,660Jan 13, 2022Updated 4 years ago
greydanus / baby-a3c
View on GitHub
A high-performance Atari A3C agent in 180 lines of PyTorch
☆173Jul 31, 2021Updated 4 years ago
pathak22 / noreward-rl
View on GitHub
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
☆1,471Dec 7, 2022Updated 3 years ago
jingweiz / pytorch-dnc
View on GitHub
Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom
☆278Feb 20, 2018Updated 8 years ago
onlytailei / A3C-PyTorch
View on GitHub
PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch
☆114Apr 3, 2017Updated 8 years ago
miyosuda / async_deep_reinforce
View on GitHub
Asynchronous Methods for Deep Reinforcement Learning
☆591Aug 9, 2018Updated 7 years ago
awjuliani / Meta-RL
View on GitHub
Implementation of Meta-RL A3C algorithm
☆407Feb 22, 2017Updated 9 years ago
kimhc6028 / pytorch-noreward-rl
View on GitHub
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
☆80Jan 5, 2019Updated 7 years ago
miyosuda / unreal
View on GitHub
Reinforcement learning with unsupervised auxiliary tasks
☆423Feb 13, 2019Updated 7 years ago
steveKapturowski / tensorflow-rl
View on GitHub
Implementations of deep RL papers and random experimentation
☆178Apr 7, 2018Updated 7 years ago
rll / rllab
View on GitHub
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
☆3,045Jun 10, 2023Updated 2 years ago
nsavinov / gym-vizdoom
View on GitHub
Gym wrapper for Vizdoom environments
☆12Dec 14, 2018Updated 7 years ago
andrewliao11 / pytorch-a3c-mujoco
View on GitHub
Implement A3C for Mujoco gym envs
☆73Nov 2, 2017Updated 8 years ago
atgambardella / pytorch-es
View on GitHub
Evolution Strategies in PyTorch
☆354Sep 11, 2017Updated 8 years ago
ypxie / pytorch-NeuCom
View on GitHub
Pytorch implementation of DeepMind's differentiable neural computer paper.
☆93Dec 4, 2017Updated 8 years ago
muupan / async-rl
View on GitHub
Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)
☆408Feb 25, 2017Updated 9 years ago
Nasdin / ReinforcementLearning-AtariGame
View on GitHub
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…
☆195Sep 19, 2024Updated last year
zuoxingdong / lagom
View on GitHub
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
☆378Nov 19, 2022Updated 3 years ago
Alfredvc / paac
View on GitHub
Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning
☆201Jun 3, 2017Updated 8 years ago
Kaixhin / NoisyNet-A3C
View on GitHub
Noisy Networks for Exploration
☆187Jan 28, 2018Updated 8 years ago
alexis-jacq / Pytorch-DPPO
View on GitHub
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆184Mar 25, 2018Updated 7 years ago
jaromiru / AI-blog
View on GitHub
Accompanying repository for Let's make a DQN / A3C series.
☆394Sep 4, 2018Updated 7 years ago
Kaixhin / Dist-A3C
View on GitHub
Distributed A3C
☆34Dec 22, 2017Updated 8 years ago
zuoxingdong / VIN_PyTorch_Visdom
View on GitHub
PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.
☆225Mar 29, 2017Updated 8 years ago
ikostrikov / pytorch-trpo
View on GitHub
PyTorch implementation of Trust Region Policy Optimization
☆450Sep 13, 2018Updated 7 years ago
jingweiz / pytorch-distributed
View on GitHub
Ape-X DQN & DDPG with pytorch & tensorboard
☆102Jun 18, 2019Updated 6 years ago
higgsfield / RL-Adventure
View on GitHub
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
☆3,163Nov 4, 2021Updated 4 years ago
joschu / modular_rl
View on GitHub
Implementation of TRPO and related algorithms
☆647May 20, 2018Updated 7 years ago
ml-jku / baselines-rudder
View on GitHub
RUDDER for ATARI games with delayed rewards in OpenAI Baselines package
☆268Oct 24, 2019Updated 6 years ago