Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆55May 12, 2025Updated last year
Alternatives and similar repositories for PPO-RND
Users that are interested in PPO-RND are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Random Network Distillation pytorch☆262Mar 4, 2019Updated 7 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- Our version of #Exploration: A Study of Count-Based Explorationfor Deep Reinforcement Learning for a class project☆16Apr 30, 2021Updated 5 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆161Apr 28, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Random Network Distillation(RND) algo in Pytorch☆51Feb 26, 2019Updated 7 years ago
- ☆30Jan 27, 2025Updated last year
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 5 years ago
- clear single-file JAX implementations of common RL algorithms☆15Sep 5, 2021Updated 4 years ago
- Docker-based, gym-like torcs environment with vision.☆19Apr 18, 2022Updated 4 years ago
- A2C is a special case of PPO!☆22May 20, 2022Updated 4 years ago
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆16Aug 2, 2022Updated 3 years ago
- Code for the paper "Exploration by Random Network Distillation"☆933Oct 1, 2020Updated 5 years ago
- A collection of RL algorithms written in JAX.☆105Jul 5, 2022Updated 3 years ago
- Neural Network Genetic Algorithm library used for deep learning problems☆18Jun 2, 2021Updated 4 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆28Oct 28, 2018Updated 7 years ago
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆13Jul 13, 2020Updated 5 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆59Jan 22, 2021Updated 5 years ago
- Posted at AAAI 2023☆11Sep 4, 2025Updated 8 months ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- CartPole-v0 via PPO with GAE, PyTorch☆21Feb 10, 2019Updated 7 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 6 years ago
- Collection of resources on plasticity loss in deep reinforcement learning☆23Nov 12, 2024Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆210Jun 18, 2024Updated last year
- Visualisation of MCTS in Unity with C# for different games, being created for my third year university project at the University of York☆16Jun 12, 2018Updated 7 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 3 years ago
- Latent Dynamics Mixture, NeurIPS 2021☆18Oct 25, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch Implementation of Visual GAIL in Atari Games☆14Dec 7, 2022Updated 3 years ago
- Code for the paper "Batch size invariance for policy optimization"☆60Apr 2, 2023Updated 3 years ago
- 🐍 A Python Package for Seamless Data Distribution in AI Workflows☆26Nov 30, 2023Updated 2 years ago
- ☆19Mar 28, 2019Updated 7 years ago
- Running RL algorithms on the fish/shark aquarium environment to find unexpected biological insights.☆10Nov 30, 2021Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆266Nov 23, 2025Updated 6 months ago