maximecb / baby-ai-game
☆36Updated this week
Related projects: ⓘ
- ☆44Updated 5 years ago
- ☆70Updated this week
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 4 years ago
- Modular multitask reinforcement learning with policy sketches☆105Updated 3 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆70Updated 7 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆77Updated 11 months ago
- NIPS 2017 Value Prediction Network☆165Updated 6 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆79Updated 5 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Updated 7 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 5 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- ☆130Updated this week
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- On the pitfalls of measuring emergent communication☆33Updated 5 years ago
- Train an RL agent to play multiple Atari games at once☆71Updated 8 years ago
- Some hard problems for reinforcement learning.☆32Updated 5 years ago
- ☆42Updated 7 years ago
- Model-Based Generative Adversarial Imitation Learning☆88Updated 3 years ago
- An official TensorFlow implementation of "Neural Program Synthesis from Diverse Demonstration Videos" (ICML 2018) by Shao-Hua Sun, Hyeonw…☆102Updated last year
- Karel dataset for program synthesis and program induction☆78Updated 6 years ago
- some common TD Learning algorithms☆67Updated 4 years ago
- Neural Programmer-Interpreter Implementation (Reed, de Freitas: https://arxiv.org/abs/1511.06279), in Tensorflow☆41Updated 5 years ago
- RL framework for embodied agents based on PyTorch☆12Updated 5 years ago
- ☆98Updated 8 years ago
- General Game Playing with Schema Networks☆41Updated 2 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆56Updated 8 years ago
- ML/DL/RL paper notes☆21Updated 5 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆52Updated 6 years ago