rlglab / optionzeroLinks

[ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm

☆19

Alternatives and similar repositories for optionzero

Users that are interested in optionzero are comparing it to the libraries listed below

Sorting:

Shengjiewang-Jason / EfficientZeroV2
[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
☆89Updated 11 months ago
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆105Updated 3 weeks ago
SonyResearch / simba
☆99Updated 4 months ago
EmptyJackson / unifloral
Unified Implementations of Offline Reinforcement Learning Algorithms
☆85Updated 2 months ago
k4ntz / OC_Atari
Object Centric Atari games
☆85Updated last month
cooperativex / SocialJax
SocialJax: sequential social dilemma environments
☆41Updated last month
timoklein / redo
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
☆28Updated 8 months ago
dojeon-ai / SimbaV2
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆57Updated last month
conglu1997 / SynthER
Synthetic Experience Replay
☆94Updated last year
MichalBortkiewicz / JaxGCRL
Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.
☆175Updated 2 months ago
automl / CARL
Benchmarking RL generalization in an interpretable way.
☆157Updated last month
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆145Updated last year
mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆173Updated 4 months ago
jrobine / twm
Transformer-based World Models
☆84Updated 2 years ago
instadeepai / og-marl
Datasets with baselines for offline multi-agent reinforcement learning.
☆174Updated 2 months ago
jon--lee / decision-pretrained-transformer
Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…
☆69Updated last year
x35f / unstable_baselines
Re-implementations of SOTA RL algorithms.
☆134Updated last year
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆102Updated 8 months ago
kristery / Elastic-DT
[NeurIPS 2023] Implementation of Elastic Decision Transformer
☆35Updated last year
weipu-zhang / STORM
☆93Updated last year
mklissa / dceo
Learning diverse options through the Laplacian representation.
☆23Updated last year
nissymori / JAX-CORL
Clean single-file implementation of offline RL algorithms in JAX
☆150Updated 6 months ago
mxu34 / prompt-dt
Official code repository for Prompt-DT.
☆113Updated 2 years ago
OffDynamicsRL / off-dynamics-rl
☆49Updated 7 months ago
typoverflow / WiseRL
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆20Updated 3 months ago
conglu1997 / v-d4rl
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆103Updated last year
jhejna / inverse-preference-learning
☆41Updated 2 years ago
UT-Austin-RPL / amago
a simple and scalable agent for training adaptive policies with sequence-based RL
☆131Updated 2 weeks ago
Div99 / XQL
Extreme Q-Learning: Max Entropy RL without Entropy
☆87Updated 2 years ago
gwthomas / IQL-PyTorch
A PyTorch implementation of Implicit Q-Learning
☆83Updated 3 years ago