hope-yao / cartpole
Tensorflow implementation of DQN to control cart-pole from OpenAI gym environment
☆14Updated 7 years ago
Alternatives and similar repositories for cartpole:
Users that are interested in cartpole are comparing it to the libraries listed below
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 5 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 4 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 5 years ago
- A reusable framework for successor features for transfer in deep reinforcement learning using keras.☆43Updated 3 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 6 years ago
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆31Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- Safe Reinforcement Learning algorithms☆74Updated 2 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆139Updated 6 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated last month
- PyTorch implementation of CommNet☆36Updated 7 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆99Updated 3 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Distributed Deep Reinforcement Learning☆29Updated 4 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆33Updated 5 years ago
- ☆75Updated 10 months ago
- Soft Actor-Critic with advanced features☆49Updated this week
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆40Updated 6 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 5 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- ☆28Updated 6 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago