Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆55May 12, 2025Updated 11 months ago
Alternatives and similar repositories for PPO-RND
Users that are interested in PPO-RND are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆56Nov 10, 2025Updated 5 months ago
- Random Network Distillation pytorch☆261Mar 4, 2019Updated 7 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- Our version of #Exploration: A Study of Count-Based Explorationfor Deep Reinforcement Learning for a class project☆16Apr 30, 2021Updated 4 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Baseline implementation of recurrent PPO using truncated BPTT☆160Apr 28, 2024Updated last year
- Random Network Distillation(RND) algo in Pytorch☆51Feb 26, 2019Updated 7 years ago
- ☆29Jan 27, 2025Updated last year
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 5 years ago
- clear single-file JAX implementations of common RL algorithms☆16Sep 5, 2021Updated 4 years ago
- Docker-based, gym-like torcs environment with vision.☆20Apr 18, 2022Updated 3 years ago
- A2C is a special case of PPO!☆22May 20, 2022Updated 3 years ago
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- ☆16Aug 2, 2022Updated 3 years ago
- Code for the paper "Exploration by Random Network Distillation"☆932Oct 1, 2020Updated 5 years ago
- A collection of RL algorithms written in JAX.☆105Jul 5, 2022Updated 3 years ago
- Neural Network Genetic Algorithm library used for deep learning problems☆18Jun 2, 2021Updated 4 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆28Oct 28, 2018Updated 7 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆58Jan 22, 2021Updated 5 years ago
- Posted at AAAI 2023☆11Sep 4, 2025Updated 7 months ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CartPole-v0 via PPO with GAE, PyTorch☆21Feb 10, 2019Updated 7 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 5 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆209Jun 18, 2024Updated last year
- Adaptive Risk Tendency Implicit Quantile Network for Drone Navigation under Partial Observability.☆37Mar 29, 2022Updated 4 years ago
- Visualisation of MCTS in Unity with C# for different games, being created for my third year university project at the University of York☆15Jun 12, 2018Updated 7 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 2 years ago
- Latent Dynamics Mixture, NeurIPS 2021☆18Oct 25, 2022Updated 3 years ago
- PyTorch Implementation of Visual GAIL in Atari Games☆14Dec 7, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper "Batch size invariance for policy optimization"☆60Apr 2, 2023Updated 3 years ago
- 🐍 A Python Package for Seamless Data Distribution in AI Workflows☆26Nov 30, 2023Updated 2 years ago
- ☆18Mar 28, 2019Updated 7 years ago
- Running RL algorithms on the fish/shark aquarium environment to find unexpected biological insights.☆10Nov 30, 2021Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- JAX implementations of various deep reinforcement learning algorithms.☆25Feb 2, 2025Updated last year
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆89Nov 22, 2017Updated 8 years ago