CursedSeraphim / icmppoLinks

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

☆16

Alternatives and similar repositories for icmppo

Users that are interested in icmppo are comparing it to the libraries listed below

Sorting:

twni2016 / Meta-SAC
Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
☆32Updated 4 years ago
mengf1 / CHER
Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)
☆65Updated 5 years ago
toshikwa / slac.pytorch
PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).
☆93Updated last year
rmst / rlrd
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
☆41Updated 3 years ago
jatinarora2702 / gail-pytorch
PyTorch implementation of GAIL and PPO reinforcement learning algorithms
☆25Updated 4 years ago
Mee321 / policy-distillation
☆14Updated 5 years ago
jakegrigsby / super_sac
A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…
☆38Updated last year
HumanCompatibleAI / learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆31Updated 4 years ago
navneet-nmk / Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
☆62Updated 6 years ago
pairlab / d2rl
Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"
☆40Updated 4 years ago
jesbu1 / hidio
Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options
☆46Updated 3 years ago
uoe-agents / derl
The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)
☆27Updated 3 years ago
montrealrobotics / iv_rl
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Updated 3 weeks ago
RchalYang / Soft-Module
Code for "Multi-task Reinforcement Learning with Soft Modularization"
☆122Updated 4 years ago
deep-skill-chaining / deep-skill-chaining
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆29Updated 5 years ago
kristery / Imitation-Learning-from-Imperfect-Demonstration
[ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"
☆50Updated 6 years ago
martius-lab / HiTS
Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021
☆34Updated 3 years ago
zoharri / mamba
Meta-RL Model-Based Algorithm
☆40Updated 3 months ago
yashbonde / Transformer-RL
Experiments to train transformer network to master reinforcement learning environments.
☆32Updated 4 years ago
hari-sikchi / LOOP
Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]
☆40Updated 2 years ago
TakuyaHiraoka / Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
Source files to replicate experiments in my ICLR 2022 paper.
☆70Updated 3 weeks ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
OpenRL-Lab / TiZero
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
☆61Updated last year
trzhang0116 / HRAC
PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020 spot…
☆42Updated last year
LunjunZhang / world-model-as-a-graph
Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)
☆66Updated 4 years ago
montaserFath / BCO
behavior cloning from observation
☆36Updated 4 years ago
flowersteam / TeachMyAgent
TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.
☆76Updated last year
alirezakazemipour / DDPG-HER
Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.
☆100Updated 2 months ago
junjungoal / IMPALA-pytorch
PyTorch IMPALA implementation
☆27Updated 5 years ago
psclklnk / spdl
Source code for the Self-Paced Deep Reinforcement Learning Experiments
☆32Updated 2 years ago