Surprise-based intrinsic motivation for deep reinforcement learning
☆21Mar 6, 2017Updated 9 years ago
Alternatives and similar repositories for surprise
Users that are interested in surprise are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experiments from "The Description Length of Deep Learning Models"☆10Aug 1, 2018Updated 7 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Oct 31, 2018Updated 7 years ago
- Models built with TensorFlow☆26Dec 5, 2018Updated 7 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- An experiment with Thompson sampling and TD(0) on a grid world variant☆17Nov 8, 2013Updated 12 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- ☆10Dec 9, 2021Updated 4 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- High granularity and accuracy Starcraft replay data extractor which outputs to a database☆14Feb 18, 2022Updated 4 years ago
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14May 1, 2018Updated 7 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Jun 28, 2019Updated 6 years ago
- A packaged and slightly-modified version of https://github.com/bbitmaster/ale_python_interface☆33Dec 23, 2016Updated 9 years ago
- Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"☆17Dec 17, 2019Updated 6 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- Learning from Trajectories via Subgoal Discovery☆12Dec 10, 2020Updated 5 years ago
- Leave No Trace is an algorithm for safe reinforcement learning.☆15Apr 30, 2018Updated 7 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 7 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆38Jan 8, 2017Updated 9 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- Let Pydantic and Shapely work together!☆18Jan 27, 2026Updated last month
- NIPS2017 challenge☆49Oct 7, 2018Updated 7 years ago
- A reinforcement learning framework☆157Dec 26, 2018Updated 7 years ago
- Implementation of Deep Q-learning from Demonstrations using Keras and a Retro Gym environment.☆14Jul 16, 2018Updated 7 years ago
- ☆12Oct 5, 2020Updated 5 years ago
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 5 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆349Nov 22, 2018Updated 7 years ago
- A Chainer implementation of WGAN-GP.☆12Oct 4, 2017Updated 8 years ago
- ☆10Nov 13, 2019Updated 6 years ago
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- Episodic Control☆22Sep 20, 2022Updated 3 years ago
- A simple Gridworld environment for Open AI gym☆25Jun 10, 2018Updated 7 years ago
- Reinforcement learning algorithms for OSU's bipedal robot - Cassie☆13Feb 8, 2018Updated 8 years ago
- Presentation on Human-Level Control Through Deep Reinforcement Learning☆13Feb 28, 2016Updated 10 years ago