njustesen / a2c_gvgaiLinks
A2C for GVG-AI
☆22Updated 7 years ago
Alternatives and similar repositories for a2c_gvgai
Users that are interested in a2c_gvgai are comparing it to the libraries listed below
Sorting:
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Updated 5 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Updated 8 years ago
- Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"☆29Updated 3 years ago
- ☆35Updated 7 years ago
- Code accompanying the OptionGAN paper.☆44Updated 7 years ago
- RL framework for embodied agents based on PyTorch☆11Updated 6 years ago
- This is my implementation of the Optimality Tightening☆37Updated 8 years ago
- E2C implementation in PyTorch☆43Updated 8 years ago
- SeqGAN but with more bells and whistles☆24Updated 7 years ago
- Reward Learning by Simulating the Past☆46Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 5 years ago
- On the pitfalls of measuring emergent communication☆34Updated 6 years ago
- Reinforcement learning algorithm implementations and ML experimentation workspace☆43Updated 6 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 7 years ago
- Distributed A3C☆34Updated 8 years ago
- Separating value functions across time-scales.☆17Updated 6 years ago
- Tensorflow Implementation of Programmable Agents☆35Updated 8 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆24Updated 2 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆68Updated 7 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆81Updated 8 years ago
- Ranking Policy Gradient☆23Updated 6 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 8 years ago
- Understanding Short-Horizon Bias in Stochastic Meta-Optimization☆37Updated 7 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆70Updated 8 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Updated 7 years ago
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆80Updated 8 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated 2 years ago
- Robust policy search algorithms which train on model ensembles☆30Updated 9 years ago
- ☆44Updated 7 years ago