A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆182Feb 10, 2019Updated 7 years ago
Alternatives and similar repositories for A2C
Users that are interested in A2C are comparing it to the libraries listed below
Sorting:
- Trust Region Policy Optimization (TRPO) in pure TensorFlow☆18Jun 7, 2018Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Jul 3, 2018Updated 7 years ago
- Probabilistic line search algorithm for stochastic optimization with a TensorFlow interface.☆22Jul 27, 2017Updated 8 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆52Feb 4, 2020Updated 6 years ago
- TensorFlow input pipelines for multiple datasets for easy data fetching☆54Dec 19, 2016Updated 9 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,875May 29, 2022Updated 3 years ago
- Deepmind Recurrent Environment Simulators paper implementation in tensorflow☆74Feb 2, 2018Updated 8 years ago
- ☆119Jul 9, 2020Updated 5 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆591Aug 9, 2018Updated 7 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,316Sep 25, 2019Updated 6 years ago
- Automatic code generator for training Reinforcement Learning policies☆11Jan 3, 2021Updated 5 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 7 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37May 9, 2019Updated 6 years ago
- This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…☆692Dec 18, 2025Updated 2 months ago
- A multi-agent environment using Unity ML-Agents Toolkit☆10Dec 9, 2020Updated 5 years ago
- ☆12Dec 7, 2017Updated 8 years ago
- OpenAI Baselines: high-quality implementations of reinforcement learning algorithms☆16,651Aug 1, 2024Updated last year
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Mar 5, 2018Updated 7 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆408Feb 25, 2017Updated 9 years ago
- TensorFlow Reinforcement Learning☆3,135Dec 8, 2022Updated 3 years ago
- Deep RL for portfolio management☆13Aug 31, 2018Updated 7 years ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- Predicting sales with Pandas☆15Nov 4, 2015Updated 10 years ago
- Electroplating simulation environment☆20Sep 26, 2024Updated last year
- Soft Actor-Critic☆1,220Nov 29, 2023Updated 2 years ago
- ☆13May 4, 2017Updated 8 years ago
- Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games☆12Feb 15, 2018Updated 8 years ago
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 6 years ago
- A repo to design basic Policy Gradient labs☆12Jul 6, 2023Updated 2 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 4 years ago
- Repository for codes of 'Deep Reinforcement Learning'☆218Oct 4, 2019Updated 6 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆55Sep 21, 2018Updated 7 years ago
- Annealed Importance Sampling (AIS) for generative models.☆16Jul 20, 2018Updated 7 years ago
- ☆13Feb 17, 2018Updated 8 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆139Mar 27, 2024Updated last year
- Data Science in Scala - Conf. Talk Repo☆15Mar 22, 2016Updated 9 years ago
- ☆17Sep 15, 2017Updated 8 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆64Apr 27, 2017Updated 8 years ago