xwhan / walk_the_blocks
Implementation of Scheduled Policy Optimization for task-oriented language grouding
☆29Updated 6 years ago
Related projects: ⓘ
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆41Updated 7 years ago
- An attempt at applying Deep RL on the board game 2048☆16Updated 7 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 6 years ago
- Pointer Networks☆100Updated 8 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 7 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 6 years ago
- hierarchical Q-learning implementation☆11Updated 9 years ago
- Neural-based Noise Filtering from Word Embeddings☆11Updated 7 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆31Updated 6 years ago
- ☆53Updated 7 years ago
- DDPG on OpenAI Gym Pendulum☆19Updated 8 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆54Updated 5 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆27Updated 4 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 5 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆32Updated 8 years ago
- Ranking Policy Gradient☆23Updated 4 years ago
- ICML 2017 accepted papers on arXiv.org☆17Updated 7 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 6 years ago
- Attention models☆34Updated 8 years ago
- Atari gauntlet for RL agents☆29Updated 7 years ago
- Deep generative model for sentiment analysis☆34Updated 7 years ago
- (Theano) Implementations about deep neural network, recurrent neural network, LSTM, and structured learining.☆10Updated 7 years ago
- ☆10Updated 4 years ago
- Deep Reinforcement Learning with pytorch & visdom (the branch for A3C continuous control)☆24Updated 6 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 7 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆27Updated 7 years ago
- Reranking-based dependency parsing with inside-outside recursive neural network☆20Updated 9 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆38Updated 7 years ago