hope-yao / cartpoleLinks
Tensorflow implementation of DQN to control cart-pole from OpenAI gym environment
☆14Updated 7 years ago
Alternatives and similar repositories for cartpole
Users that are interested in cartpole are comparing it to the libraries listed below
Sorting:
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 2 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- Multi-Objective Reinforcement Learning components built on top of RL glue components☆29Updated 2 years ago
- POMDP wrappers for OpenAI Gym☆15Updated 5 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 5 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 8 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆21Updated 7 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 6 years ago
- Safe Reinforcement Learning algorithms☆74Updated 2 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 2 months ago
- PyTorch implementation of various reinforcement learning algorithms☆18Updated 7 years ago
- Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space☆46Updated 7 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆138Updated 2 years ago
- A reusable framework for successor features for transfer in deep reinforcement learning using keras.☆44Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆22Updated 4 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆52Updated 4 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 4 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- ppo-lstm-parallel☆44Updated 6 years ago
- Applying minimaxQ learning algorithm to 2 agents games☆33Updated 7 years ago
- Soft Actor-Critic with advanced features☆50Updated this week
- Interfacing RL agents with user-definable neural networks and OpenAI-gym environments.☆12Updated 6 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago