A continuous action space version of A3C LSTM in pytorch plus A3G design
☆260Oct 11, 2024Updated last year
Alternatives and similar repositories for a3c_continuous
Users that are interested in a3c_continuous are comparing it to the libraries listed below
Sorting:
- A3C LSTM Atari with Pytorch plus A3G design☆570Apr 18, 2023Updated 2 years ago
- A high-performance Atari A3C agent in 180 lines of PyTorch☆173Jul 31, 2021Updated 4 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,316Sep 25, 2019Updated 6 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆106Jun 7, 2019Updated 6 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆96Jul 27, 2022Updated 3 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Jul 4, 2018Updated 7 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 5 years ago
- Simple A3C implementation with pytorch + multiprocessing☆657Mar 10, 2023Updated 2 years ago
- ☆36Aug 10, 2018Updated 7 years ago
- Evolution Strategies Tool☆958Dec 8, 2022Updated 3 years ago
- Highly Modular and Scalable Reinforcement Learning☆118Jan 14, 2020Updated 6 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Jun 2, 2020Updated 5 years ago
- ☆69Nov 30, 2018Updated 7 years ago
- ppo-lstm-parallel☆49Mar 26, 2019Updated 6 years ago
- ☆54Feb 19, 2018Updated 8 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Jul 30, 2018Updated 7 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Jun 18, 2019Updated 6 years ago
- Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)☆238Apr 16, 2018Updated 7 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,875May 29, 2022Updated 3 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- ICML 2018 Self-Imitation Learning☆278Apr 18, 2020Updated 5 years ago
- Implementation of TRPO and related algorithms☆647May 20, 2018Updated 7 years ago
- Tensorflow implementation of proximal policy optimization (PPO) algorithm☆13Feb 28, 2018Updated 8 years ago
- This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…☆692Dec 18, 2025Updated 2 months ago
- ☆18Feb 7, 2021Updated 5 years ago
- [MIG2021] Deep Reinforcement Learning with Particle Filtering Policy Network for Physics-Based Character Control☆17Feb 25, 2022Updated 4 years ago
- Deep Reinforcement Learning with pytorch & visdom☆804Jul 16, 2020Updated 5 years ago
- (Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760☆24May 3, 2019Updated 6 years ago
- Gated Path Planning Networks (ICML 2018)☆180Jan 23, 2019Updated 7 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆195Sep 19, 2024Updated last year
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆773Dec 22, 2023Updated 2 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆591Aug 9, 2018Updated 7 years ago
- PERCH 2.0 : Fast and Accurate GPU-based Perception via Search for Object Pose Estimation☆16Aug 22, 2020Updated 5 years ago
- Rainbow: Combining Improvements in Deep Reinforcement Learning☆1,660Jan 13, 2022Updated 4 years ago
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,272Feb 9, 2021Updated 5 years ago
- A PyTorch Library for Reinforcement Learning Research☆198Jun 22, 2025Updated 8 months ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Dec 8, 2022Updated 3 years ago
- Implementation of Meta-RL A3C algorithm☆407Feb 22, 2017Updated 9 years ago