Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
☆206Nov 22, 2018Updated 7 years ago
Alternatives and similar repositories for atari-reset
Users that are interested in atari-reset are comparing it to the libraries listed below
Sorting:
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆33Nov 22, 2018Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆278Apr 18, 2020Updated 5 years ago
- Code for the paper "Exploration by Random Network Distillation"☆930Oct 1, 2020Updated 5 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Code for the paper "Evolved Policy Gradients"☆253Nov 22, 2018Updated 7 years ago
- Publicly releasable baselines for the Retro contest☆129Nov 22, 2018Updated 7 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆28Oct 28, 2018Updated 7 years ago
- ☆119Jul 9, 2020Updated 5 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- Code for the paper "Emergent Complexity via Multi-agent Competition"☆828Apr 2, 2023Updated 2 years ago
- Code for the paper "Quantifying Transfer in Reinforcement Learning"☆408Oct 7, 2023Updated 2 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆95Jul 27, 2022Updated 3 years ago
- ☆29Jun 23, 2018Updated 7 years ago
- [IJCAI'20][ICLR'19 Workshop] Flow-based Intrinsic Curiosity Module. Playing SuperMario with RL agent and FICM!☆104Dec 8, 2022Updated 3 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆348Nov 22, 2018Updated 7 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆436Nov 28, 2023Updated 2 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆309Apr 13, 2023Updated 2 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆117Dec 13, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆205Oct 2, 2020Updated 5 years ago
- Chef cookbooks for managing a Ceph cluster☆11Apr 2, 2023Updated 2 years ago
- Code for hierarchical imitation learning and reinforcement learning☆301Mar 14, 2018Updated 7 years ago
- Code for the paper "Large-Scale Study of Curiosity-Driven Learning"☆830Aug 12, 2021Updated 4 years ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆12Apr 2, 2023Updated 2 years ago
- Fluentd output plugin that sends events to Amazon Kinesis Streams and Amazon Kinesis Firehose.☆12Apr 2, 2023Updated 2 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Oct 26, 2020Updated 5 years ago
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆581Dec 8, 2022Updated 3 years ago
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆1,019Mar 13, 2019Updated 6 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆374Oct 15, 2021Updated 4 years ago
- NIPS 2017 Value Prediction Network☆167Jan 12, 2018Updated 8 years ago
- Inferring beliefs about dynamics from behavior☆30May 24, 2018Updated 7 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Value Iteration Networks☆291Apr 21, 2017Updated 8 years ago
- [ICLR 2018] TensorFlow code for zero-shot visual imitation by self-supervised exploration☆203May 30, 2018Updated 7 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Oct 12, 2017Updated 8 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago