Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆55May 12, 2025Updated 11 months ago
Alternatives and similar repositories for PPO-RND
Users that are interested in PPO-RND are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆57Nov 10, 2025Updated 5 months ago
- Random Network Distillation pytorch☆261Mar 4, 2019Updated 7 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- Our version of #Exploration: A Study of Count-Based Explorationfor Deep Reinforcement Learning for a class project☆16Apr 30, 2021Updated 5 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Baseline implementation of recurrent PPO using truncated BPTT☆161Apr 28, 2024Updated 2 years ago
- Random Network Distillation(RND) algo in Pytorch☆51Feb 26, 2019Updated 7 years ago
- ☆30Jan 27, 2025Updated last year
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 5 years ago
- clear single-file JAX implementations of common RL algorithms☆16Sep 5, 2021Updated 4 years ago
- Docker-based, gym-like torcs environment with vision.☆19Apr 18, 2022Updated 4 years ago
- A2C is a special case of PPO!☆22May 20, 2022Updated 3 years ago
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- ☆16Aug 2, 2022Updated 3 years ago
- Code for the paper "Exploration by Random Network Distillation"☆932Oct 1, 2020Updated 5 years ago
- A collection of RL algorithms written in JAX.☆105Jul 5, 2022Updated 3 years ago
- Neural Network Genetic Algorithm library used for deep learning problems☆18Jun 2, 2021Updated 4 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆28Oct 28, 2018Updated 7 years ago
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆13Jul 13, 2020Updated 5 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆59Jan 22, 2021Updated 5 years ago
- Posted at AAAI 2023☆11Sep 4, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- CartPole-v0 via PPO with GAE, PyTorch☆21Feb 10, 2019Updated 7 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 5 years ago
- Collection of resources on plasticity loss in deep reinforcement learning☆23Nov 12, 2024Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆210Jun 18, 2024Updated last year
- Adaptive Risk Tendency Implicit Quantile Network for Drone Navigation under Partial Observability.☆38Mar 29, 2022Updated 4 years ago
- Visualisation of MCTS in Unity with C# for different games, being created for my third year university project at the University of York☆16Jun 12, 2018Updated 7 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Latent Dynamics Mixture, NeurIPS 2021☆18Oct 25, 2022Updated 3 years ago
- Code for the paper "Batch size invariance for policy optimization"☆60Apr 2, 2023Updated 3 years ago
- 🐍 A Python Package for Seamless Data Distribution in AI Workflows☆26Nov 30, 2023Updated 2 years ago
- ☆19Mar 28, 2019Updated 7 years ago
- Running RL algorithms on the fish/shark aquarium environment to find unexpected biological insights.☆10Nov 30, 2021Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆264Nov 23, 2025Updated 5 months ago