A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆182Feb 10, 2019Updated 7 years ago
Alternatives and similar repositories for A2C
Users that are interested in A2C are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Actor Critic using Kronecker-Factored Trust Region☆19Jul 3, 2018Updated 7 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆52Feb 4, 2020Updated 6 years ago
- Trust Region Policy Optimization (TRPO) in pure TensorFlow☆18Jun 7, 2018Updated 7 years ago
- Probabilistic line search algorithm for stochastic optimization with a TensorFlow interface.☆21Jul 27, 2017Updated 8 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆591Aug 9, 2018Updated 7 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,882May 29, 2022Updated 3 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37May 9, 2019Updated 6 years ago
- ☆18Mar 19, 2019Updated 7 years ago
- ☆120Jul 9, 2020Updated 5 years ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- Automatic code generator for training Reinforcement Learning policies☆11Jan 3, 2021Updated 5 years ago
- OpenAI Baselines: high-quality implementations of reinforcement learning algorithms☆16,677Aug 1, 2024Updated last year
- Implementation of the Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch☆41Mar 23, 2018Updated 8 years ago
- Finding Generalizable Evidence by Learning to Convince Q&A Models☆25Jan 5, 2023Updated 3 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆408Feb 25, 2017Updated 9 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,317Sep 25, 2019Updated 6 years ago
- TensorFlow Reinforcement Learning☆3,135Dec 8, 2022Updated 3 years ago
- Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.☆100Feb 1, 2020Updated 6 years ago
- ☆12Dec 7, 2017Updated 8 years ago
- in this repo, I create models to process image (upscale, debluring...)☆15Oct 5, 2021Updated 4 years ago
- Actor-critic with experience replay☆258Oct 9, 2022Updated 3 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆88Mar 5, 2018Updated 8 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆154May 28, 2023Updated 2 years ago
- Collection of Deep Reinforcement Learning algorithms☆300Mar 19, 2019Updated 7 years ago
- Soft Actor-Critic☆1,230Nov 29, 2023Updated 2 years ago
- A repo to design basic Policy Gradient labs☆12Jul 6, 2023Updated 2 years ago
- Deepmind Recurrent Environment Simulators paper implementation in tensorflow☆74Feb 2, 2018Updated 8 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆55Sep 21, 2018Updated 7 years ago
- This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…☆693Dec 18, 2025Updated 3 months ago
- A set of Deep Reinforcement Learning Agents implemented in Tensorflow.☆2,276Feb 12, 2019Updated 7 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- TensorFlow input pipelines for multiple datasets for easy data fetching☆54Dec 19, 2016Updated 9 years ago
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆1,021Mar 13, 2019Updated 7 years ago
- ☆29Jun 23, 2018Updated 7 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆83Jul 19, 2018Updated 7 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Implementation of Attentive Multi Task Deep Reinforcement Learning Architecture in Tensorflow☆15Apr 5, 2019Updated 6 years ago
- 本项目致力于多人合作实现强化学习用于交通信号灯控制领域,代码将同步更新☆12Mar 11, 2019Updated 7 years ago