CUN-bjy / gym-ddpg-kerasLinks
Keras Implementation of DDPG(Deep Deterministic Policy Gradient) with PER(Prioritized Experience Replay) option on OpenAI gym framework
☆13Updated 2 years ago
Alternatives and similar repositories for gym-ddpg-keras
Users that are interested in gym-ddpg-keras are comparing it to the libraries listed below
Sorting:
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Updated 3 years ago
- ☆10Updated 2 years ago
- ☆19Updated last year
- Official code for the paper: Invertible Neural Network for Graph Prediction☆10Updated 2 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- Implementation of Proximal Policy Optimization in Jax+Flax☆20Updated 2 years ago
- Repo for ICML'23 paper SurCo Learning Linear Surrogates For Combinatorial Nonlinear Optimization Problems☆18Updated 2 years ago
- Link to paper: https://www.ssrn.com/abstract=3804655☆14Updated 4 years ago
- Public code for implementation and experiments with differentiable decision trees.☆29Updated last year
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 5 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Updated 4 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Updated 4 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆14Updated 4 years ago
- ☆14Updated 2 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆27Updated 4 years ago
- Contextual Bandits Action Elimination DQN☆21Updated 7 years ago
- ☆11Updated 4 years ago
- Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control☆26Updated 2 years ago
- Understanding RL vision Distill article☆24Updated 2 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆53Updated 4 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆32Updated 4 years ago
- A set of algorithms and environments to train SafeRL agents, written in TensorFlow2 and OpenAI Gym.☆12Updated 3 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆105Updated 4 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆38Updated 7 months ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 2 years ago
- Reinforcement Learning Methods with PyTorch☆38Updated 5 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 5 years ago