orrivlin / MountainCar_DQN_RNDLinks

Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)

☆42

Alternatives and similar repositories for MountainCar_DQN_RND

Users that are interested in MountainCar_DQN_RND are comparing it to the libraries listed below

Sorting:

alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
Coac / never-give-up
PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies
☆59Updated 4 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆142Updated 6 years ago
jcwleo / curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
☆138Updated 2 years ago
ac-93 / soft-actor-critic
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆96Updated 5 years ago
chagmgang / pytorch_ppo_rl
Pytorch implementation of intrinsic curiosity module with proximal policy optimization
☆53Updated 6 years ago
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆130Updated 11 months ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
navneet-nmk / Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
☆60Updated 6 years ago
createamind / DRL
☆92Updated 4 years ago
kandouss / marlgrid
Gridworld for MARL experiments
☆141Updated 4 years ago
BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆123Updated 4 years ago
rpatrik96 / AttA2C
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning
☆27Updated 5 years ago
BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆50Updated last week
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆178Updated 11 months ago
jcwleo / random-network-distillation-pytorch
Random Network Distillation pytorch
☆252Updated 6 years ago
junjungoal / IMPALA-pytorch
PyTorch IMPALA implementation
☆26Updated 5 years ago
011235813 / hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
☆103Updated 3 years ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆52Updated 2 years ago
wendelinboehmer / dcg
☆76Updated last year
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
younggyoseo / pytorch-nfsp
Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)
☆45Updated 6 years ago
tjuHaoXiaotian / GASIL
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
☆31Updated 6 years ago
hsvgbkhgbv / SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …
☆119Updated 8 months ago
TonghanWang / ROMA
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
☆160Updated 2 years ago
jerrodparker20 / adaptive-transformers-in-rl
Adaptive Attention Span for Reinforcement Learning
☆133Updated 5 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆133Updated last week
nikhilbarhate99 / TD3-PyTorch-BipedalWalker-v2
Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
☆106Updated 6 years ago
koulanurag / minimal-marl
Minimal implementation of multi-agent reinforcement learning algorithms
☆56Updated 3 years ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year