Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆37Dec 7, 2020Updated 5 years ago
Alternatives and similar repositories for EMI
Users that are interested in EMI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 7 years ago
- Implementation of Data Efficient Reinforcement Learning in Pytorch☆20Aug 6, 2019Updated 6 years ago
- Repository for our ICML 2019 paper: Curiosity-Bottleneck☆34Nov 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Proto-RL: Reinforcement Learning with Prototypical Representations☆86Jun 12, 2022Updated 3 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- ☆141Feb 26, 2019Updated 7 years ago
- impact-driven-exploration☆133Oct 3, 2023Updated 2 years ago
- ☆18Jun 8, 2023Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- MVE: model-based value estimation☆11Jul 30, 2018Updated 7 years ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- ☆14May 9, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆256May 3, 2020Updated 6 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- ☆40Nov 23, 2021Updated 4 years ago
- "Detecting Extrapolation with Local Ensembles" by David Madras, James Atwood, and Alex D'Amour☆13Sep 25, 2020Updated 5 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆94Jul 25, 2024Updated last year
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆59Jan 22, 2021Updated 5 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Mar 9, 2023Updated 3 years ago
- Code for Automatic Curriculum Learning through Value Disagreement☆31Jun 15, 2020Updated 5 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- Minimizing Control for Credit Assignment with Strong Feedback☆14Nov 3, 2024Updated last year
- RRT NBV Exploration☆15Mar 4, 2023Updated 3 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆70Aug 11, 2023Updated 2 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Jan 21, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- ☆47Dec 8, 2022Updated 3 years ago
- Project structure of Deep Learning experiments☆12Jan 20, 2018Updated 8 years ago
- BASALT Benchmark datasets, evaluation code and agent training example.☆22Nov 29, 2023Updated 2 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).☆21Mar 4, 2023Updated 3 years ago