mcmachado/count_based_exploration_sr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mcmachado/count_based_exploration_sr)

mcmachado / count_based_exploration_sr

☆31

Alternatives and similar repositories for count_based_exploration_sr

Users that are interested in count_based_exploration_sr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bonniesjli / DQN_SR
View on GitHub
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Jun 19, 2019Updated 7 years ago
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
oxwhirl / opiq
View on GitHub
Code for Optimistic Exploration even with a Pessimistic Initialisation
☆14Aug 4, 2020Updated 5 years ago
greatwallet / mountain-car
View on GitHub
A simple baseline for mountain-car @ gym
☆12Jan 15, 2020Updated 6 years ago
mcmachado / b-pro
View on GitHub
☆21May 31, 2019Updated 7 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
mingen-pan / Reinforcement-Learning-Q-learning-Gridworld-Pytorch
View on GitHub
This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld
☆14Jul 13, 2020Updated 6 years ago
seungyulhan / disc
View on GitHub
☆10Aug 17, 2022Updated 3 years ago
rahul13ramesh / SuccessorOptions
View on GitHub
Successor Options is an option discovery framework for Reinforcement Learning
☆14Jun 17, 2024Updated 2 years ago
rpatrik96 / AttA2C
View on GitHub
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning
☆29Nov 27, 2019Updated 6 years ago
abbyvansoest / maxent
View on GitHub
☆14May 30, 2019Updated 7 years ago
holarissun / PCHID_code
View on GitHub
Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
☆15Jan 7, 2020Updated 6 years ago
ubisoft / ubisoft-laforge-asaf
View on GitHub
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
☆16Dec 10, 2020Updated 5 years ago
erwanbou / sf-deep-rl
View on GitHub
Project on Successor Features in Deep Reinforcement Learning and Transfer Learning
☆24Feb 5, 2018Updated 8 years ago
jinnaiyuu / Optimal-Options-ICML-2019
View on GitHub
Code for generating options for planning and reinforcement learning
☆12Feb 18, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tuomaso / radial_rl
View on GitHub
Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"
☆33Oct 3, 2023Updated 2 years ago
joeybose / FloRL
View on GitHub
Implicit Normalizing Flows + Reinforcement Learning
☆62May 31, 2019Updated 7 years ago
kristychoi / pixel_exploration
View on GitHub
PyTorch implementation of Count-Based Exploration with Neural Density Models
☆10Mar 22, 2018Updated 8 years ago
robintyh1 / onpolicybaselines
View on GitHub
on-policy optimization baselines for deep reinforcement learning
☆32Apr 3, 2020Updated 6 years ago
tianjunz / NovelD
View on GitHub
☆40Nov 23, 2021Updated 4 years ago
flowersteam / geppg
View on GitHub
☆36Aug 10, 2018Updated 7 years ago
MouseHu / GEM
View on GitHub
☆16Jul 1, 2021Updated 5 years ago
mklissa / PPOC
View on GitHub
Proximal Policy Option-Critic
☆26Jan 4, 2019Updated 7 years ago
astier / model-free-episodic-control
View on GitHub
Model-Free-Episodic-Control implementation.
☆17Jun 3, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
veronicachelu / temporal_abstraction
View on GitHub
Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…
☆24Nov 29, 2018Updated 7 years ago
ahmed-touati / controllable_agent
View on GitHub
☆61Jun 6, 2023Updated 3 years ago
RonanFR / UCRL
View on GitHub
☆27May 17, 2019Updated 7 years ago
szemenyeim / DynEnv
View on GitHub
Dynamic Simulation Environments for Reinforcement Learning
☆13Apr 17, 2021Updated 5 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
RobertTLange / gym-hanoi
View on GitHub
A Towers of Hanoi environment in OpenAI Gym Style
☆14Jun 6, 2019Updated 7 years ago
BorealisAI / pommerman-baseline
View on GitHub
Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"
☆38May 9, 2019Updated 7 years ago
shamanez / VUSFA-Variational-Universal-Successor-Features-Approximator
View on GitHub
This repository contains implementations of the paper VUSFA
☆14Mar 31, 2021Updated 5 years ago
deep-skill-chaining / deep-skill-chaining
View on GitHub
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆30Sep 24, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
facebookresearch / impact-driven-exploration
View on GitHub
impact-driven-exploration
☆136Oct 3, 2023Updated 2 years ago
DuaneNielsen / rnd
View on GitHub
Exploration by Random Network Distillation
☆15Dec 30, 2018Updated 7 years ago
widmi / rudder-a-practical-tutorial
View on GitHub
A practical step-by-step guide to applying RUDDER
☆36Nov 12, 2019Updated 6 years ago
robfiras / s2pg
View on GitHub
Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"
☆25May 5, 2024Updated 2 years ago
tesatory / hsp
View on GitHub
Hierarchical Self-Play
☆21Dec 5, 2018Updated 7 years ago
ming93 / Safe_reinforcement_learning
View on GitHub
Convergent Policy Optimization for Safe Reinforcement Learning
☆11Oct 26, 2019Updated 6 years ago