openai / sonic-on-ray
Training Sonic with RLlib
☆58Updated last year
Alternatives and similar repositories for sonic-on-ray:
Users that are interested in sonic-on-ray are comparing it to the libraries listed below
- OpenAI Retro Contest☆65Updated last year
- Publicly releasable baselines for the Retro contest☆127Updated 6 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Updated 6 years ago
- ☆117Updated 4 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆197Updated 6 years ago
- Reason8.ai PyTorch solution for NIPS RL 2017 challenge☆84Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- ☆56Updated 2 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆72Updated last year
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 4 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆20Updated 7 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 7 years ago
- Tutorial on continuous control at Reinforcement Learning Summer School 2017.☆34Updated 7 years ago
- Models built with TensorFlow☆25Updated 6 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated last year
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 7 years ago
- ☆29Updated 6 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Updated 7 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 7 years ago
- A reinforcement learning framework☆154Updated 6 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆32Updated 6 years ago
- ☆44Updated 6 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 5 years ago
- Wikipedia navigation environment for OpenAI Gym☆40Updated last year
- Deep RL Bootcamp solutions☆35Updated 7 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 6 years ago