Exploration by Random Network Distillation
☆15Dec 30, 2018Updated 7 years ago
Alternatives and similar repositories for rnd
Users that are interested in rnd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The open source of FeverBasketball environment for research purpose.☆11Mar 2, 2020Updated 6 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Rank TD: End-to-End Robotic Reinforcement Learning without Reward Engineering and Demonstrations☆14Oct 8, 2022Updated 3 years ago
- Map-Elites based on Evolution Strategies☆33Feb 11, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Learning from Trajectories via Subgoal Discovery☆12Dec 10, 2020Updated 5 years ago
- DDPG on OpenAI Gym Pendulum☆17Jul 1, 2016Updated 9 years ago
- ☆13Nov 17, 2015Updated 10 years ago
- ☆13Apr 3, 2019Updated 7 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- ForgER algorithm☆23Oct 3, 2022Updated 3 years ago
- Exploration based Reinforcement Learning. (Montezuma Revenge)☆14Jul 23, 2018Updated 7 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆95Jul 27, 2022Updated 3 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A uniform collection of API bindings for various cryptocurrency exchanges☆13Mar 14, 2018Updated 8 years ago
- ☆12Feb 20, 2021Updated 5 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Setup for Octo and some experiments with the model☆12Apr 11, 2024Updated 2 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- Python implementation of state-of-art meta-heuristic and evolutionary optimization algorithms.☆12Jun 29, 2022Updated 3 years ago
- Implementation of Few-shot Binary Image Classification using Contrastive Learning-based Approach in PyTorch☆11May 1, 2023Updated 3 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆204Oct 2, 2020Updated 5 years ago
- A collection of papers on divergence and quality diversity☆79Aug 12, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Co-training for Policy Learning☆13Aug 8, 2019Updated 6 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆20Mar 10, 2021Updated 5 years ago
- Pytorch implementation of Planar Flow☆17Dec 2, 2019Updated 6 years ago
- Quandl.com API implementation in Haskell☆17Jul 3, 2021Updated 4 years ago
- Hindsight policy gradients☆46Jan 31, 2020Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Continuous Energy Minimization for Multitarget Tracking☆20Feb 9, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆21Jul 14, 2020Updated 5 years ago
- Code for the Reset-free Trial and Error learning paper (RTE) experiments☆10Jan 3, 2018Updated 8 years ago
- Algorithmic Music Composition☆11Aug 28, 2018Updated 7 years ago
- ☆25Dec 8, 2022Updated 3 years ago
- ☆29Apr 16, 2021Updated 5 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 4 years ago
- Code companion of Multi-task Learning for Aggregated Data using Gaussian Processes paper☆11Apr 6, 2020Updated 6 years ago