cog-isa / forger

ForgER algorithm

☆22

Related projects: ⓘ

RajGhugare19 / stitching-is-combinatorial-generalisation
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆19Updated 5 months ago
martius-lab / pink-noise-rl
☆37Updated last year
YYCAAA / V-MPO_Lunarlander
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆44Updated 3 years ago
tseyde / decqn
☆33Updated last year
danijar / crafter-baselines
Docker containers of baseline agents for the Crafter environment
☆27Updated 2 years ago
proceduralia / high_replay_ratio_continuous_control
Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"
☆22Updated last year
chscheller / minerl_agent
3rd placed submission to the NeurIPS MineRL competition 2019
☆10Updated last year
samlobel / CFN
Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023
☆16Updated 8 months ago
RajGhugare19 / alm
Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective
☆76Updated last year
ikostrikov / dmcgym
☆23Updated 2 years ago
denisyarats / exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
☆100Updated 2 years ago
QData / dmc_remastered
A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.
☆16Updated 3 years ago
rmrafailov / LOMPO
Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models
☆28Updated 3 years ago
pairlab / vagram
[ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.
☆24Updated last year
ini / multigrid
Fast and flexible multi-agent gridworld reinforcement learning environments.
☆27Updated last month
tedmoskovitz / TOP
Implementation of Tactical Optimistic and Pessimistic value estimation
☆24Updated last year
jsikyoon / V-MPO_torch
V-MPO torch version with DMLab30 and GTrXL
☆12Updated 3 years ago
RyanNavillus / reward-surfaces
☆15Updated 4 months ago
facebookresearch / hsd3
Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines
☆49Updated 2 years ago
uoe-agents / derl
The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)
☆24Updated 2 years ago
ahmed-touati / controllable_agent
☆34Updated last year
seohongpark / HIQL
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆71Updated 9 months ago
yifan12wu / rl-laplacian
Learning Laplacian Representations in Reinforcement Learning
☆17Updated 3 years ago
51616 / marl-lipo
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
☆14Updated 4 months ago
UtkarshMishra04 / pixel-representations-RL
This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…
☆12Updated last year
ikostrikov / jaxrl2
☆41Updated last year
young-geng / JaxCQL
Conservative Q learning in Jax
☆49Updated last year
ndrwmlnk / critic-guided-segmentation-of-rewarding-objects-in-first-person-views
Critic Guided Segmentation of Rewarding Objects in First-Person Views. Explanatory video:
☆12Updated 2 years ago
architsharma97 / earl_benchmark
EARL: Environment for Autonomous Reinforcement Learning
☆33Updated last year
seohongpark / LSD
Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)
☆32Updated last year