sermanet / rewards
Unsupervised Perceptual Rewards for Imitation Learning
☆12Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for rewards
- ☆15Updated 7 years ago
- ☆15Updated 4 years ago
- ☆13Updated 6 years ago
- ☆19Updated 6 years ago
- Visual Transition State Clustering☆13Updated 6 years ago
- Model-Free Episodic Control☆15Updated 7 years ago
- Implementation of Residual Learning with Stochastic Depth http://arxiv.org/pdf/1603.09382v2.pdf☆10Updated 8 years ago
- This repository implements the paper, Model-Agnostic Meta-Leanring for Fast Adaptation of Deep Networks.☆16Updated 7 years ago
- Generalized Compressed Network Search with PyTorch☆26Updated 7 years ago
- Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"☆29Updated 2 years ago
- WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer☆11Updated 7 years ago
- ☆13Updated 7 years ago
- DiDi-Udacity Self-Driving Car Challenge 2017 Raw Data Reader☆11Updated 7 years ago
- Deep reinforcement learning package for torch7☆16Updated 8 years ago
- a library for deep reinforcement learning, with applications for navigation☆16Updated 6 years ago
- Unsupervised instance segmentation via active robot interaction☆77Updated 2 years ago
- ☆12Updated 8 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Updated 7 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Updated 7 years ago
- Code for the blog post on few-shot classification via task representation and communication.☆18Updated 7 years ago
- ☆10Updated 8 years ago
- An implementation of BiternionNets for ROS, ready to run on a robot.☆13Updated 6 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Updated 7 years ago
- Models built with TensorFlow☆25Updated 5 years ago
- ☆15Updated 8 years ago
- A very simple variant of adversarial training that yields excellent results on MNIST☆12Updated 8 years ago