Shmuma / rl
RL experiments
☆69Updated last year
Related projects: ⓘ
- A reinforcement learning framework☆154Updated 5 years ago
- ☆117Updated 4 years ago
- Combining deep learning and reinforcement learning.☆81Updated 2 years ago
- C51-DDQN in Keras☆125Updated 6 years ago
- This package allows to use PLE as a gym environment.☆73Updated 4 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- Reason8.ai PyTorch solution for NIPS RL 2017 challenge☆84Updated 4 years ago
- ☆22Updated 5 years ago
- Reinforcement Learning in Keras on VizDoom☆146Updated 6 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆30Updated 6 years ago
- Bandits Environments for the OpenAI Gym☆88Updated 4 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆100Updated 6 years ago
- Highly Modular and Scalable Reinforcement Learning☆113Updated 4 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆197Updated 3 years ago
- ☆124Updated 6 years ago
- Velocity in deep-learning research☆276Updated last year
- Easy TensorFlow logging for quick prototypes☆110Updated 2 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- Publicly releasable baselines for the Retro contest☆128Updated 5 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆140Updated last year
- Direct Future Prediction (DFP ) in Keras☆109Updated 6 years ago
- some common TD Learning algorithms☆67Updated 4 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 4 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 5 years ago
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆39Updated 2 years ago
- Deep Reinforcement Learning library for humans☆300Updated 7 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆373Updated last year
- A simple stochastic OpenAI environment for training RL agents☆89Updated last year
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆101Updated 4 years ago