petrosgk / MalmoRL
A framework for training Reinforcement Learning agents in Minecraft with Project Malmö
☆19Updated 6 years ago
Alternatives and similar repositories for MalmoRL:
Users that are interested in MalmoRL are comparing it to the libraries listed below
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Updated 7 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 7 years ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 5 years ago
- Modular multitask reinforcement learning with policy sketches☆106Updated 3 years ago
- tensorflow deep RL hacking on minecraft with malmo☆54Updated 8 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Updated 8 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆92Updated 6 years ago
- Meta Reinforcement Learning Experiments☆33Updated 7 years ago
- Imagination Augmented Agents TensorFlow☆26Updated 4 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 7 years ago
- Implementation of Deepmind's Neural Episodic Control☆59Updated 6 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- ☆22Updated 6 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆70Updated 7 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- ☆46Updated 6 years ago
- Gym - Doom environments based on VizDoom.☆102Updated 7 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- Proximal Policy Optimization in PyTorch☆38Updated 7 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- (Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760☆24Updated 5 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆79Updated 6 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- On the pitfalls of measuring emergent communication☆34Updated 5 years ago