motokiomura / apex_dqn
☆18Updated 5 months ago
Alternatives and similar repositories for apex_dqn:
Users that are interested in apex_dqn are comparing it to the libraries listed below
- Pytorch implementation of distributed deep reinforcement learning☆75Updated 2 years ago
- Fast Flexible Replay Buffer Library (Mirror repository of https://gitlab.com/ymd_h/cpprb)☆72Updated 2 months ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆133Updated 7 years ago
- PyTorch Implementation of Distributed Prioritized Experience Replay(Ape-X)☆153Updated 5 years ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆44Updated last year
- PyTorch implementation of FQF, IQN and QR-DQN.☆169Updated 7 months ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆148Updated 3 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆113Updated 6 years ago
- ☆28Updated 5 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 4 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- Keeping track of RL experiments☆162Updated 2 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆30Updated 4 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆40Updated 6 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆56Updated 4 years ago
- FQF(Fully parameterized Quantile Function for distributional reinforcement learning) is a general reinforcement learning framework for At…☆42Updated 4 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Updated 5 years ago
- Deep reinforcement learning with tensorflow2☆93Updated last week
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- ☆193Updated last year
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆136Updated 6 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Updated 6 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆27Updated 4 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆189Updated 5 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆89Updated 7 months ago