hope-yao / cartpoleLinks
Tensorflow implementation of DQN to control cart-pole from OpenAI gym environment
☆14Updated 7 years ago
Alternatives and similar repositories for cartpole
Users that are interested in cartpole are comparing it to the libraries listed below
Sorting:
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- ☆28Updated 7 years ago
- PyTorch implementation of CommNet☆36Updated 7 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆43Updated 2 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 6 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 8 years ago
- Safe Reinforcement Learning algorithms☆74Updated 2 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 6 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆95Updated 2 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Updated 4 years ago
- ☆76Updated last year
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 5 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Updated 3 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆50Updated last month
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆31Updated 6 years ago
- POMDP wrappers for OpenAI Gym☆15Updated 5 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 7 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆50Updated 4 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- ☆11Updated 5 years ago
- research and implementations of Deep RL agents and their applications☆51Updated 3 weeks ago
- Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space☆46Updated 7 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆32Updated 3 years ago