xwhan / walk_the_blocks
Implementation of Scheduled Policy Optimization for task-oriented language grouding
☆29Updated 6 years ago
Alternatives and similar repositories for walk_the_blocks:
Users that are interested in walk_the_blocks are comparing it to the libraries listed below
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆43Updated 8 years ago
- Sorting numbers with pointer networks☆55Updated 6 years ago
- Pointer Networks☆103Updated 9 years ago
- Deep generative model for sentiment analysis☆34Updated 8 years ago
- TensorFlow implementation of Pointer Networks, modified to use a threshold (or hardmax) pointer instead of a softmax pointer.☆40Updated 7 years ago
- Neural-based Noise Filtering from Word Embeddings☆11Updated 7 years ago
- Pytorch implementation of Human-Level Control through Deep Reinforcement Learning☆11Updated 7 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- Reranking-based dependency parsing with inside-outside recursive neural network☆20Updated 10 years ago
- An attempt at applying Deep RL on the board game 2048☆16Updated 8 years ago
- ☆53Updated 8 years ago
- This repo is for residual-connected sentence encoder for NLI.☆11Updated 7 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆26Updated 7 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆31Updated 7 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- This project contains several Deep Reinforcement Learning method and some experiments basd on OpenAi gym.☆20Updated 7 years ago
- ICML 2017 accepted papers on arXiv.org☆17Updated 7 years ago
- Attention models☆33Updated 9 years ago
- Atari gauntlet for RL agents☆29Updated 8 years ago
- Deep Reinforcement Learning with pytorch & visdom (the branch for A3C continuous control)☆24Updated 7 years ago
- Recurrent neural networks and Dynamic memory networks for sentiment classification☆30Updated 7 years ago
- code for our IJCAI 2018 paper : "Lifelong Domain Word Embedding via Meta-Learning"☆11Updated 5 years ago
- in progress☆107Updated 7 years ago
- An optimized version of SeqGAN in pytorch☆13Updated 6 years ago
- Experimentation with Highway Networks & GradNets☆13Updated 9 years ago
- Dynamic Entity Representation (Kobayashi et al., 2016)☆20Updated 8 years ago
- DDPG on OpenAI Gym Pendulum☆18Updated 8 years ago
- ☆12Updated 6 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago