hope-yao / cartpoleLinks
Tensorflow implementation of DQN to control cart-pole from OpenAI gym environment
☆14Updated 7 years ago
Alternatives and similar repositories for cartpole
Users that are interested in cartpole are comparing it to the libraries listed below
Sorting:
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆24Updated last year
- A PyTorch implementation of deep Q-learning for Atari games☆14Updated 6 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11Updated 5 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆78Updated 4 years ago
- Safe Reinforcement Learning algorithms☆74Updated 3 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 3 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆27Updated 4 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 5 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Multi-Objective Reinforcement Learning components built on top of RL glue components☆29Updated 2 years ago
- This is pytorch implmentation project of Bootsrapped DQN☆12Updated 4 years ago
- Soft Actor-Critic with advanced features☆51Updated last week
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 8 years ago
- PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)☆33Updated 3 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆62Updated 6 years ago
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0☆15Updated 7 years ago
- Project exploring Multi Task Deep Reinforcement Learning neural network architectures and algorithms with Open AI Gym and TensorFlow☆17Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆146Updated 6 years ago
- Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networ…☆17Updated 6 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆104Updated 4 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆119Updated 9 months ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆16Updated 5 years ago
- SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis☆29Updated 6 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆23Updated 4 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Updated 6 years ago