aborghi / retro_contest_agent
☆29Updated 6 years ago
Alternatives and similar repositories for retro_contest_agent:
Users that are interested in retro_contest_agent are comparing it to the libraries listed below
- ☆46Updated 6 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆99Updated 6 years ago
- ☆43Updated 5 years ago
- Reinforcement learning in 3D.☆21Updated 8 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Updated 7 years ago
- ☆117Updated 4 years ago
- Reason8.ai PyTorch solution for NIPS RL 2017 challenge☆84Updated 5 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 7 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated 2 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
- Web-based Reinforcement Learning Control Center☆64Updated 8 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆42Updated 6 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Updated 6 years ago
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 4 years ago
- OpenAI Retro Contest☆65Updated 2 years ago
- Collection of tutorials, exercises and papers on RL☆17Updated 7 years ago
- A reinforcement learning framework☆155Updated 6 years ago
- Implementation of modular composition network from https://arxiv.org/pdf/1711.11289.pdf☆25Updated 7 years ago
- Keras implementation of DQN on ViZDoom environment☆54Updated 8 years ago
- Tutorial on continuous control at Reinforcement Learning Summer School 2017.☆34Updated 7 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆53Updated 5 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- ☆22Updated 6 years ago
- Publicly releasable baselines for the Retro contest☆127Updated 6 years ago
- Models built with TensorFlow☆25Updated 6 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago
- ☆30Updated 8 years ago
- Training Sonic with RLlib☆59Updated 2 years ago
- Add-on for OpenAI Gym that supports automatic downloading of user environments.☆45Updated 7 years ago