njustesen / a2c_gvgaiLinks
A2C for GVG-AI
☆21Updated 6 years ago
Alternatives and similar repositories for a2c_gvgai
Users that are interested in a2c_gvgai are comparing it to the libraries listed below
Sorting:
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"☆29Updated 2 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Updated 7 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Updated 4 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Updated 4 years ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Updated 6 years ago
- The Variational Homoencoder: Learning to learn high capacity generative models from few examples☆34Updated 2 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- Understanding Short-Horizon Bias in Stochastic Meta-Optimization☆37Updated 7 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Updated 6 years ago
- RL framework for embodied agents based on PyTorch☆12Updated 6 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 6 years ago
- This is my implementation of the Optimality Tightening☆37Updated 8 years ago
- Tensorflow Implementation of Programmable Agents☆35Updated 7 years ago
- ☆35Updated 6 years ago
- Contextual Bandits Action Elimination DQN☆21Updated 7 years ago
- Model-Free Episodic Control☆14Updated 8 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- Efficient zero-shot goal planning☆13Updated 7 years ago
- Reinforcement learning algorithm implementations and ML experimentation workspace☆43Updated 6 years ago
- Random memory adaptation model inspired by the paper: "Memory-based parameter adaptation (MbPA)"☆24Updated 7 years ago
- Code for a generative controller for the AI Gym cartpole task☆15Updated 8 years ago
- Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks☆40Updated 5 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 6 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 6 years ago
- ☆15Updated 8 years ago
- Reward Learning by Simulating the Past☆44Updated 6 years ago