Surprise-based intrinsic motivation for deep reinforcement learning
☆21Mar 6, 2017Updated 9 years ago
Alternatives and similar repositories for surprise
Users that are interested in surprise are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experiments from "The Description Length of Deep Learning Models"☆10Aug 1, 2018Updated 7 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Oct 31, 2018Updated 7 years ago
- Deep PILCO PyTorch Implementation☆15Mar 25, 2023Updated 3 years ago
- Design good curriculums for deep reinforcement learning☆14May 18, 2016Updated 9 years ago
- Models built with TensorFlow☆26Dec 5, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆160Jul 21, 2017Updated 8 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- An experiment with Thompson sampling and TD(0) on a grid world variant☆17Nov 8, 2013Updated 12 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14May 1, 2018Updated 8 years ago
- A packaged and slightly-modified version of https://github.com/bbitmaster/ale_python_interface☆33Dec 23, 2016Updated 9 years ago
- Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"☆17Dec 17, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- Learning from Trajectories via Subgoal Discovery☆12Dec 10, 2020Updated 5 years ago
- Leave No Trace is an algorithm for safe reinforcement learning.☆15Apr 30, 2018Updated 8 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 8 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆38Jan 8, 2017Updated 9 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- Conditional Random Fields implemented as Lasagne layer☆10Jul 22, 2016Updated 9 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆134May 5, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Dec 13, 2018Updated 7 years ago
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- Repo for a generalised DQN Agent model capable of solving major discrete action space control problems☆18Aug 20, 2018Updated 7 years ago
- 高雄 python 社群活動整理☆10Apr 5, 2019Updated 7 years ago
- Contains code relating to this arxiv paper https://arxiv.org/abs/1802.03761☆37Mar 19, 2018Updated 8 years ago
- NIPS2017 challenge☆49Oct 7, 2018Updated 7 years ago
- A reinforcement learning framework☆157Dec 26, 2018Updated 7 years ago
- Implementation of Deep Q-learning from Demonstrations using Keras and a Retro Gym environment.☆14Jul 16, 2018Updated 7 years ago
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Oct 5, 2020Updated 5 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- ☆10Nov 13, 2019Updated 6 years ago
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- Episodic Control☆22Sep 20, 2022Updated 3 years ago
- A simple Gridworld environment for Open AI gym☆25Jun 10, 2018Updated 7 years ago
- Reinforcement learning algorithms for OSU's bipedal robot - Cassie☆13Feb 8, 2018Updated 8 years ago