MG2033/A2C

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MG2033/A2C)

MG2033 / A2C

A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow

☆182

Alternatives and similar repositories for A2C

Users that are interested in A2C are comparing it to the libraries listed below

Sorting:

MahanFathi / TRPO-TensorFlow
View on GitHub
Trust Region Policy Optimization (TRPO) in pure TensorFlow
☆18Jun 7, 2018Updated 7 years ago
gd-zhang / ACKTR
View on GitHub
Actor Critic using Kronecker-Factored Trust Region
☆19Jul 3, 2018Updated 7 years ago
MethodsOfMachineLearning / probabilistic_line_search
View on GitHub
Probabilistic line search algorithm for stochastic optimization with a TensorFlow interface.
☆22Jul 27, 2017Updated 8 years ago
rgilman33 / baselines-A2C
View on GitHub
[DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation
☆52Feb 4, 2020Updated 6 years ago
ischlag / tensorflow-input-pipelines
View on GitHub
TensorFlow input pipelines for multiple datasets for easy data fetching
☆54Dec 19, 2016Updated 9 years ago
ikostrikov / pytorch-a2c-ppo-acktr-gail
View on GitHub
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…
☆3,875May 29, 2022Updated 3 years ago
KokoMind / Recurrent-Environment-Simulators
View on GitHub
Deepmind Recurrent Environment Simulators paper implementation in tensorflow
☆74Feb 2, 2018Updated 8 years ago
openai / baselines-results
View on GitHub
☆119Jul 9, 2020Updated 5 years ago
miyosuda / async_deep_reinforce
View on GitHub
Asynchronous Methods for Deep Reinforcement Learning
☆591Aug 9, 2018Updated 7 years ago
ikostrikov / pytorch-a3c
View on GitHub
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
☆1,316Sep 25, 2019Updated 6 years ago
davide97l / rl-traingenerator
View on GitHub
Automatic code generator for training Reinforcement Learning policies
☆11Jan 3, 2021Updated 5 years ago
david-abel / state_abstraction
View on GitHub
Code for abstracting, evaluating, and visualizing Markov Decision Processes.
☆10Jan 12, 2017Updated 9 years ago
fanshiliang / Hierarchical-Deep-Reinforcement-Learning
View on GitHub
paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation
☆10Mar 27, 2018Updated 7 years ago
BorealisAI / pommerman-baseline
View on GitHub
Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"
☆37May 9, 2019Updated 6 years ago
TianhongDai / reinforcement-learning-algorithms
View on GitHub
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…
☆692Dec 18, 2025Updated 2 months ago
simoninithomas / ml-agents-snowball-fight
View on GitHub
A multi-agent environment using Unity ML-Agents Toolkit
☆10Dec 9, 2020Updated 5 years ago
csc2541-f17 / csc2541-f17.github.io
View on GitHub
☆12Dec 7, 2017Updated 8 years ago
openai / baselines
View on GitHub
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
☆16,651Aug 1, 2024Updated last year
mrkulk / hierarchical-deep-RL
View on GitHub
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation
☆87Mar 5, 2018Updated 7 years ago
muupan / async-rl
View on GitHub
Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)
☆408Feb 25, 2017Updated 9 years ago
google-deepmind / trfl
View on GitHub
TensorFlow Reinforcement Learning
☆3,135Dec 8, 2022Updated 3 years ago
jjakimoto / PPO-Pytorch
View on GitHub
Deep RL for portfolio management
☆13Aug 31, 2018Updated 7 years ago
divyahansg / RecurrentDPG
View on GitHub
CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)
☆10Jun 10, 2017Updated 8 years ago
zygmuntz / kaggle-rossmann
View on GitHub
Predicting sales with Pandas
☆15Nov 4, 2015Updated 10 years ago
lshengjian / ep_sim
View on GitHub
Electroplating simulation environment
☆20Sep 26, 2024Updated last year
haarnoja / sac
View on GitHub
Soft Actor-Critic
☆1,220Nov 29, 2023Updated 2 years ago
khanrc / mnist
View on GitHub
☆13May 4, 2017Updated 8 years ago
ppyht2 / tf-a2c
View on GitHub
Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games
☆12Feb 15, 2018Updated 8 years ago
antonio-f / Dynamic-Programming
View on GitHub
Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…
☆11Apr 3, 2019Updated 6 years ago
osigaud / Basic-Policy-Gradient-Labs
View on GitHub
A repo to design basic Policy Gradient labs
☆12Jul 6, 2023Updated 2 years ago
dion-jy / gym-td3-keras
View on GitHub
Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework
☆11May 29, 2021Updated 4 years ago
Kyushik / DRL
View on GitHub
Repository for codes of 'Deep Reinforcement Learning'
☆218Oct 4, 2019Updated 6 years ago
angusfung / population-based-training
View on GitHub
Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.
☆55Sep 21, 2018Updated 7 years ago
jiamings / ais
View on GitHub
Annealed Importance Sampling (AIS) for generative models.
☆16Jul 20, 2018Updated 7 years ago
shinseung428 / image_control_TF
View on GitHub
☆13Feb 17, 2018Updated 8 years ago
brendanator / atari-rl
View on GitHub
Atari - Deep Reinforcement Learning algorithms in TensorFlow
☆139Mar 27, 2024Updated last year
mkolod / scala-data-science
View on GitHub
Data Science in Scala - Conf. Talk Repo
☆15Mar 22, 2016Updated 9 years ago
mansimov / acktr
View on GitHub
☆17Sep 15, 2017Updated 8 years ago
liampetti / DDPG
View on GitHub
Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…
☆64Apr 27, 2017Updated 8 years ago