alirezakazemipour/PPO-RND

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alirezakazemipour/PPO-RND)

alirezakazemipour / PPO-RND

Random network distillation on Montezuma's Revenge and Super Mario Bros.

☆55

Alternatives and similar repositories for PPO-RND

Users that are interested in PPO-RND are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wisnunugroho21 / reinforcement_learning_ppo_rnd
View on GitHub
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…
☆57Nov 10, 2025Updated 7 months ago
jcwleo / random-network-distillation-pytorch
View on GitHub
Random Network Distillation pytorch
☆262Mar 4, 2019Updated 7 years ago
orrivlin / MountainCar_DQN_RND
View on GitHub
Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)
☆41Jan 28, 2019Updated 7 years ago
clementbernardd / Count-Based-Exploration
View on GitHub
Our version of #Exploration: A Study of Count-Based Explorationfor Deep Reinforcement Learning for a class project
☆16Apr 30, 2021Updated 5 years ago
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
MarcoMeter / recurrent-ppo-truncated-bptt
View on GitHub
Baseline implementation of recurrent PPO using truncated BPTT
☆161Apr 28, 2024Updated 2 years ago
wizdom13 / RND-Pytorch
View on GitHub
Random Network Distillation(RND) algo in Pytorch
☆51Feb 26, 2019Updated 7 years ago
younggyoseo / CQN-AS
View on GitHub
☆30Jan 27, 2025Updated last year
jsikyoon / V-MPO_torch
View on GitHub
V-MPO torch version with DMLab30 and GTrXL
☆13Mar 1, 2021Updated 5 years ago
gebob19 / rl_with_jax
View on GitHub
clear single-file JAX implementations of common RL algorithms
☆15Sep 5, 2021Updated 4 years ago
gerkone / pyTORCS-docker
View on GitHub
Docker-based, gym-like torcs environment with vision.
☆19Apr 18, 2022Updated 4 years ago
vwxyzjn / a2c_is_a_special_case_of_ppo
View on GitHub
A2C is a special case of PPO!
☆23May 20, 2022Updated 4 years ago
yun-kwak / decision-transformer-jax
View on GitHub
Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku
☆13Aug 14, 2024Updated last year
luchris429 / discovered-policy-optimisation
View on GitHub
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆12Jun 15, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
vwxyzjn / gym-pysc2
View on GitHub
Gym wrapper for pysc2
☆10Sep 16, 2022Updated 3 years ago
lmzintgraf / hyperx
View on GitHub
☆16Aug 2, 2022Updated 3 years ago
openai / random-network-distillation
View on GitHub
Code for the paper "Exploration by Random Network Distillation"
☆933Oct 1, 2020Updated 5 years ago
rssalessio / nnGA
View on GitHub
Neural Network Genetic Algorithm library used for deep learning problems
☆18Jun 2, 2021Updated 5 years ago
reinforcement-learning-kr / rl-montezuma
View on GitHub
The state-of-art deep rl algorithms for Montezuma's revenge
☆28Oct 28, 2018Updated 7 years ago
mingen-pan / Reinforcement-Learning-Q-learning-Gridworld-Pytorch
View on GitHub
This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld
☆14Jul 13, 2020Updated 5 years ago
Coac / never-give-up
View on GitHub
PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies
☆58Jan 22, 2021Updated 5 years ago
raincchio / P3O
View on GitHub
Posted at AAAI 2023
☆11Sep 4, 2025Updated 9 months ago
yfletberliac / adversarially-guided-actor-critic
View on GitHub
AGAC: Adversarially Guided Actor-Critic
☆47Sep 16, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
4kasha / CartPole_PPO
View on GitHub
CartPole-v0 via PPO with GAE, PyTorch
☆21Feb 10, 2019Updated 7 years ago
LAMDA-RL / ImagineBench
View on GitHub
A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.
☆14Nov 4, 2025Updated 7 months ago
chscheller / minerl_agent
View on GitHub
3rd placed submission to the NeurIPS MineRL competition 2019
☆10Mar 24, 2023Updated 3 years ago
yining043 / SAC-discrete
View on GitHub
Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.
☆20May 18, 2020Updated 6 years ago
Probabilistic-and-Interactive-ML / awesome-plasticity-loss
View on GitHub
Collection of resources on plasticity loss in deep reinforcement learning
☆23Nov 12, 2024Updated last year
MarcoMeter / episodic-transformer-memory-ppo
View on GitHub
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆209Jun 18, 2024Updated last year
tudelft / risk-sensitive-rl
View on GitHub
Adaptive Risk Tendency Implicit Quantile Network for Drone Navigation under Partial Observability.
☆38Mar 29, 2022Updated 4 years ago
suyoung-lee / LDM
View on GitHub
Latent Dynamics Mixture, NeurIPS 2021
☆18Oct 25, 2022Updated 3 years ago
yunke-wang / gail_atari
View on GitHub
PyTorch Implementation of Visual GAIL in Atari Games
☆14Dec 7, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
openai / ppo-ewma
View on GitHub
Code for the paper "Batch size invariance for policy optimization"
☆60Apr 2, 2023Updated 3 years ago
allenai / prior
View on GitHub
🐍 A Python Package for Seamless Data Distribution in AI Workflows
☆26Nov 30, 2023Updated 2 years ago
gabrieledcjr / DeepRL
View on GitHub
☆19Mar 28, 2019Updated 7 years ago
instance01 / fish-rl-alife
View on GitHub
Running RL algorithms on the fish/shark aquarium environment to find unexpected biological insights.
☆10Nov 30, 2021Updated 4 years ago
tedmoskovitz / TOP
View on GitHub
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Jul 18, 2023Updated 2 years ago
MichalBortkiewicz / JaxGCRL
View on GitHub
Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.
☆269Jun 6, 2026Updated last week
hamishs / JAX-RL
View on GitHub
JAX implementations of various deep reinforcement learning algorithms.
☆25Feb 2, 2025Updated last year