clementbernardd / Count-Based-ExplorationLinks

Our version of #Exploration: A Study of Count-Based Explorationfor Deep Reinforcement Learning for a class project

☆15

Alternatives and similar repositories for Count-Based-Exploration

Users that are interested in Count-Based-Exploration are comparing it to the libraries listed below

Sorting:

tesslerc / ActionRobustRL
Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…
☆46Updated 6 years ago
jparkerholder / DvD_ES
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆44Updated 4 years ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆51Updated 2 years ago
wyjung0625 / p3s
Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning
☆22Updated 5 years ago
karush17 / esac
Evolution-based Soft Actor-Critic (ESAC)
☆42Updated last year
marcbrittain / Prioritized-Sequence-Experience-Replay
Prioritized Sequence Experience Replay
☆10Updated 3 years ago
HumanCompatibleAI / population-irl
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆28Updated 6 years ago
eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 4 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago
thomashirtz / gym-hybrid
Collection of OpenAI parametrized action-space environments.
☆65Updated 4 months ago
robintyh1 / onpolicybaselines
on-policy optimization baselines for deep reinforcement learning
☆30Updated 5 years ago
danielwillemsen / MAMBPO
DecentralizedLearning
☆24Updated 2 years ago
chanb / metalearning_RL
☆20Updated 2 years ago
ermongroup / multiagent-gail
☆84Updated 6 years ago
junjungoal / IMPALA-pytorch
PyTorch IMPALA implementation
☆27Updated 5 years ago
wendelinboehmer / dcg
☆75Updated last year
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
gjp1203 / nui_in_madrl
Negative Update Intervals in Multi-Agent Deep Reinforcement Learning
☆33Updated 6 years ago
IouJenLiu / PIC
PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning
☆49Updated 4 years ago
jqueeney / geppo
Generalized Proximal Policy Optimization with Sample Reuse (GePPO)
☆25Updated 2 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
manantomar / Mirror-Descent-Policy-Optimization
Mirror Descent Policy Optimization
☆38Updated 4 years ago
liuzuxin / safe-mbrl
Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method
☆66Updated 2 years ago
AlgTUDelft / AlwaysSafe
Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"
☆17Updated 3 years ago
cap-ntu / baconian-project
Model-based Reinforcement Learning Framework
☆114Updated 5 years ago
WilsonWangTHU / POPLIN
☆99Updated 2 years ago
uoe-agents / derl
The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)
☆27Updated 3 years ago
chenhongge / StateAdvDRL
[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"
☆134Updated 3 years ago
sjtu-marl / bd_rd_psro
Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
☆20Updated 3 years ago
alversafa / option-critic-arch
Implementation of the Option-Critic Architecture
☆40Updated 6 years ago