matpalm / malmomo
tensorflow deep RL hacking on minecraft with malmo
☆54Updated 8 years ago
Alternatives and similar repositories for malmomo:
Users that are interested in malmomo are comparing it to the libraries listed below
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆52Updated 7 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Updated 8 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆42Updated 6 years ago
- Model-Free Episodic Control☆14Updated 8 years ago
- ☆32Updated 7 years ago
- Malmo Collaborative AI Challenge - Team Pig Catcher☆65Updated 7 years ago
- This is my implementation of the Optimality Tightening☆37Updated 7 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 6 years ago
- Reading Group on Reinforcement Learning topics☆55Updated 8 years ago
- Reinforcement learning environments for Torch7☆93Updated 8 years ago
- [deprecated] Bridge from Gym to ROS robots☆74Updated last year
- ML/DL/RL paper notes☆20Updated 6 years ago
- Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"☆114Updated 9 years ago
- Deep Attention Recurrent Q-Network☆115Updated 9 years ago
- Deterministic Policy Gradient using torch7☆43Updated 8 years ago
- Neural Task Programming☆81Updated 6 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- Tutorial on continuous control at Reinforcement Learning Summer School 2017.☆34Updated 7 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago
- ☆19Updated 8 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆79Updated 6 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Updated 8 years ago
- ☆100Updated 8 years ago
- ☆24Updated 9 years ago
- Tensorflow Implementation of Programmable Agents☆35Updated 7 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated last year
- Train an RL agent to play multiple Atari games at once☆69Updated 8 years ago
- ☆38Updated 8 years ago
- third person imitation learning. Archival only.☆76Updated 5 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Updated 7 years ago