yanpanlau / CartPole
Various DQN method with cartpole
☆11Updated 6 years ago
Alternatives and similar repositories for CartPole:
Users that are interested in CartPole are comparing it to the libraries listed below
- Tiny implementation of Deep-Q Network with Tensorflow☆11Updated 7 years ago
- This repository contains a python implementation of a Deep Q-Network (DQN) for Atari gameplay using tensorflow.☆6Updated 6 years ago
- Plan, Attend, Generate: Planning for Sequence-to-Sequence Models☆10Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Pytorch implementation of Human-Level Control through Deep Reinforcement Learning☆11Updated 7 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- Deep Q Network implements by Tensorflow☆25Updated 7 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆42Updated 6 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- Professor Forcing, NIPS'16☆45Updated 8 years ago
- Implementation of Scheduled Policy Optimization for task-oriented language grouding☆29Updated 6 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- Contextual LSTM for NLP tasks like word prediction and word embedding creation for Deep Learning☆28Updated 5 years ago
- Bi-Directional Attention Flow for Machine Comprehensions☆9Updated 7 years ago
- Keras implementation of MinimalRNN: Toward More Interpretable and Trainable Recurrent Neural Networks☆17Updated 7 years ago
- *SEM 2018: Learning Distributed Event Representations with a Multi-Task Approach☆21Updated 6 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Updated 2 years ago
- the solustion to https://openai.com/requests-for-research☆12Updated 8 years ago
- ☆26Updated 7 years ago
- Deep Transfer Reinforcement Learning for Text Summarization☆42Updated 6 years ago
- GAN and Seq2Seq☆25Updated 6 years ago
- Value iteration, policy iteration, and Q-Learning in a grid-world MDP.☆28Updated last year
- This is a tutorial written for Caffe2 which mocks google AlphaGo Fan and AlphaGo Zero.☆8Updated 6 years ago
- ☆16Updated 8 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆32Updated 7 years ago
- Chinese Natural Language Correction via Language Model☆14Updated 7 years ago
- Codes for Category-aware Generative Adversarial Networks (AAAI 2020)☆18Updated 4 years ago
- A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"☆13Updated 7 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- This is an implimentation of Value Iteration Networks (NIPS2016 best paper) in keras☆17Updated 7 years ago