maximecb / baby-ai-game

☆36

Related projects: ⓘ

Feryal / craft-env
☆44Updated 5 years ago
mjacar / pytorch-nec
☆70Updated this week
DanielTakeshi / rl_algorithms
I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…
☆51Updated 4 years ago
jacobandreas / psketch
Modular multitask reinforcement learning with policy sketches
☆105Updated 3 years ago
floringogianu / categorical-dqn
A working implementation of the Categorical DQN (Distributional RL).
☆96Updated 6 years ago
geek-ai / 1m-agents
A platform of grid world that supports up to 1 million reinforcement-learning agents.
☆70Updated 7 years ago
instadeepai / AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆77Updated 11 months ago
junhyukoh / value-prediction-network
NIPS 2017 Value Prediction Network
☆165Updated 6 years ago
kimhc6028 / pytorch-noreward-rl
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
☆79Updated 5 years ago
nosyndicate / pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
☆49Updated 6 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆90Updated 6 years ago
junhyukoh / icml2016-minecraft
Implementation of "Control of Memory, Active Perception, and Action in Minecraft"
☆86Updated 7 years ago
facebookresearch / reward-estimator-corl
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆21Updated 5 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
mjacar / pytorch-trpo
☆130Updated this week
Breakend / OptionGAN
Code accompanying the OptionGAN paper.
☆43Updated 6 years ago
facebookresearch / measuring-emergent-comm
On the pitfalls of measuring emergent communication
☆33Updated 5 years ago
eparisotto / ActorMimic
Train an RL agent to play multiple Atari games at once
☆71Updated 8 years ago
JohnLangford / RL_acid
Some hard problems for reinforcement learning.
☆32Updated 5 years ago
mcmachado / options
☆42Updated 7 years ago
itaicaspi / mgail
Model-Based Generative Adversarial Imitation Learning
☆88Updated 3 years ago
shaohua0116 / demo2program
An official TensorFlow implementation of "Neural Program Synthesis from Diverse Demonstration Videos" (ICML 2018) by Shao-Hua Sun, Hyeonw…
☆102Updated last year
carpedm20 / karel
Karel dataset for program synthesis and program induction
☆78Updated 6 years ago
chrodan / tdlearn
some common TD Learning algorithms
☆67Updated 4 years ago
siddk / npi
Neural Programmer-Interpreter Implementation (Reed, de Freitas: https://arxiv.org/abs/1511.06279), in Tensorflow
☆41Updated 5 years ago
idlrl / flare
RL framework for embodied agents based on PyTorch
☆12Updated 5 years ago
Ardavans / DSR
☆98Updated 8 years ago
vicariousinc / schema-games
General Game Playing with Schema Networks
☆41Updated 2 years ago
sudeepraja / Model-Free-Episodic-Control
Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460
☆56Updated 8 years ago
ethancaballero / paper-notes
ML/DL/RL paper notes
☆21Updated 5 years ago
yobibyte / atarigrandchallenge
Code for 'The Grand Atari Challenge dataset' paper
☆52Updated 6 years ago