hsjharvey / Reinforcement-LearningLinks
Reinforcement learning algorithm implementation
☆10Updated 4 years ago
Alternatives and similar repositories for Reinforcement-Learning
Users that are interested in Reinforcement-Learning are comparing it to the libraries listed below
Sorting:
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Updated 6 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Updated 6 years ago
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11Updated 5 years ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆24Updated last year
- ☆16Updated 4 years ago
- Deep Reinforcement Learning for Nash Equilibria☆47Updated 3 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)☆34Updated 3 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆33Updated 5 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆28Updated 5 years ago
- Prioritized Sequence Experience Replay☆10Updated 4 years ago
- Hierarchical Attention in Reinforcement Learning for Stock Order Executions☆31Updated 4 years ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Updated 2 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Updated 6 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Updated 4 years ago
- Soft Actor-Critic with advanced features☆50Updated this week
- ☆12Updated 4 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆62Updated 6 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆56Updated 9 months ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆30Updated 7 years ago
- Reinforcement Learning with Perturbed Reward, AAAI 2020☆30Updated last year
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0☆14Updated 7 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Updated 6 years ago
- Exploration by Random Network Distillation☆15Updated 7 years ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Updated 3 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆16Updated 5 years ago
- Experiments on a discrete mean field game model of population dynamics with reinforcement learning☆38Updated 2 years ago