DavidJanz / successor_uncertainties_atariView external linksLinks
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemysław Mazur, Katja Hofmann, José Miguel Hernández-Lobato, Sebastian Tschiatschek. NeurIPS 2019. *Equal contribution
☆21Feb 24, 2023Updated 2 years ago
Alternatives and similar repositories for successor_uncertainties_atari
Users that are interested in successor_uncertainties_atari are comparing it to the libraries listed below
Sorting:
- ☆31Jul 1, 2019Updated 6 years ago
- Randomized Value Functions via Multiplicative Normalizing Flows☆17Jan 1, 2023Updated 3 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 7 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 4 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Oct 20, 2023Updated 2 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆128Jun 11, 2019Updated 6 years ago
- Reward Propagation using Graph Convolutional Networks☆13Jun 19, 2021Updated 4 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- path finding algorithms☆17Apr 17, 2024Updated last year
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆24Apr 14, 2022Updated 3 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆48Sep 17, 2020Updated 5 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.☆17Feb 17, 2021Updated 4 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Jan 21, 2021Updated 5 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Apr 28, 2019Updated 6 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- ☆24Apr 16, 2024Updated last year
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Feb 15, 2023Updated 2 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆97Jun 19, 2025Updated 7 months ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆103Jun 22, 2022Updated 3 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Jul 31, 2020Updated 5 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆67Oct 3, 2023Updated 2 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆62May 31, 2019Updated 6 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago