cilvrRG / RL
Reading Group on Reinforcement Learning topics
☆55Updated 8 years ago
Alternatives and similar repositories for RL:
Users that are interested in RL are comparing it to the libraries listed below
- Topics on theoretical, mathematical aspects of DL☆71Updated 8 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Updated 8 years ago
- Train an RL agent to play multiple Atari games at once☆70Updated 8 years ago
- ☆17Updated 7 years ago
- Universal library for deep reinforcement learning.☆38Updated 8 years ago
- ☆56Updated 6 years ago
- Deterministic Policy Gradient using torch7☆43Updated 8 years ago
- Asynchronous Advantage Actor Critic☆20Updated 8 years ago
- Reinforcement learning environments for Torch7☆92Updated 8 years ago
- ☆38Updated 7 years ago
- Learning RNN Hierarchies☆45Updated 8 years ago
- Torch implementation of the Deep Network for Global Optimization (DNGO)☆51Updated 8 years ago
- Torch7 impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆42Updated 9 years ago
- Translating neuralese☆44Updated 7 years ago
- ☆11Updated 8 years ago
- ☆97Updated 8 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 7 years ago
- Simple PuddleWorld DQN example using torch7☆29Updated 8 years ago
- Cluttered MNIST Dataset☆50Updated 9 years ago
- This is my implementation of the Optimality Tightening☆37Updated 7 years ago
- Malmo Collaborative AI Challenge - Team Pig Catcher☆65Updated 7 years ago
- Playground for reinforcement learning algorithms implemented in TensorFlow☆16Updated 8 years ago
- This is the implementation of paper Model Free Episodic Control☆36Updated 5 years ago
- Implementation of a simple example of Q learning in Torch.☆50Updated 7 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Updated 7 years ago
- Reasonably-okay-performing implementation of a GAN and an adversarial autoencoder on MNIST.☆29Updated 9 years ago
- Implementations of differentiable stacks, queues, and deques from "Learning to Transduce with Unbounded Memory"☆20Updated 9 years ago
- Learning to Discover Efficient Mathematical Identities☆48Updated 10 years ago
- WebNav: A New Large-Scale Task for Natural Language based Sequential Decision Making☆82Updated 7 years ago
- Implementation of Policy Gradient algorithms in PyTorch. (Sequential, Distributed sync + async)☆9Updated 7 years ago