xwhan / walk_the_blocksLinks
Implementation of Scheduled Policy Optimization for task-oriented language grouding
☆29Updated 7 years ago
Alternatives and similar repositories for walk_the_blocks
Users that are interested in walk_the_blocks are comparing it to the libraries listed below
Sorting:
- Collaborative Deep Reinforcement Learning☆31Updated 8 years ago
 - Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆43Updated 9 years ago
 - Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 8 years ago
 - in progress☆108Updated 8 years ago
 - Pointer Networks☆103Updated 9 years ago
 - A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Updated 8 years ago
 - Pointer Networks Implementation in Keras☆155Updated 3 years ago
 - Atari gauntlet for RL agents☆29Updated 8 years ago
 - ☆53Updated 8 years ago
 - Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆31Updated 9 years ago
 - DDPG on OpenAI Gym Pendulum☆18Updated 9 years ago
 - Sorting numbers with pointer networks☆55Updated 7 years ago
 - An attempt at applying Deep RL on the board game 2048☆17Updated 8 years ago
 - Deep generative model for sentiment analysis☆34Updated 8 years ago
 - Attention is All You Need in Sonnet☆38Updated 8 years ago
 - ☆56Updated 7 years ago
 - reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
 - TensorFlow implementation of Pointer Networks☆203Updated 8 years ago
 - Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
 - A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 8 years ago
 - reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
 - Policy gradient reinforcement learning algorithm with importance sampling☆32Updated 8 years ago
 - Neural-based Noise Filtering from Word Embeddings☆11Updated 8 years ago
 - Task-based end-to-end model learning in stochastic optimization☆212Updated 4 years ago
 - Variational Autoencoders (VAEs) in Theano for Images and Text☆54Updated 7 years ago
 - Attention models☆32Updated 9 years ago
 - TensorFlow implementation of Pointer Networks, modified to use a threshold (or hardmax) pointer instead of a softmax pointer.☆40Updated 8 years ago
 - reimplementation of the ddpg algorithm using tensorflow☆38Updated 9 years ago
 - PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 8 years ago
 - Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 8 years ago