rail-berkeley / SUPELinks

This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."

☆31

Alternatives and similar repositories for SUPE

Users that are interested in SUPE are comparing it to the libraries listed below

Sorting:

seohongpark / horizon-reduction
The official implementation of "Horizon Reduction Makes RL Scalable"
☆123Updated this week
heatz123 / tldr
Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
☆33Updated 9 months ago
kvfrans / fre
Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"
☆57Updated last year
jianlanluo / SAQ
☆33Updated last month
chongyi-zheng / infom
Implementations of Intention-conditioned Flow Occupancy Models (InFOM)
☆22Updated 3 weeks ago
jlin816 / homegrid
A minimal home grid world environment to evaluate language understanding in interactive agents.
☆22Updated last year
mazpie / genrl
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…
☆78Updated 4 months ago
seohongpark / HILP
Foundation Policies with Hilbert Representations (ICML 2024)
☆90Updated last year
cheryyunl / Make-An-Agent
☆45Updated last year
dibyaghosh / icvf_release
Public code for "Reinforcement Learning from Passive Data via Latent Intentions"
☆89Updated last year
enjeeneer / zero-shot-rl
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆17Updated 6 months ago
chandar-lab / Recall2Imagine
Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
☆70Updated last year
vmicheli / delta-iris
Efficient World Models with Context-Aware Tokenization. ICML 2024
☆105Updated 10 months ago
seohongpark / METRA
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆71Updated last year
facebookresearch / gen_dgrl
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆28Updated 11 months ago
ml-jku / L2M
Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)
☆59Updated 10 months ago
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆106Updated last month
UT-Austin-RPL / amago
off-policy RL on long sequences
☆133Updated this week
facebookresearch / agenthive
AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.
☆34Updated last year
facebookresearch / mtm
MTM Masked Trajectory Models for Prediction, Representation, and Control.
☆157Updated 2 years ago
google-deepmind / dmc_vision_benchmark
☆27Updated last year
dojeon-ai / SimbaV2
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆58Updated 2 months ago
FLAIROx / jafar
JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"
☆71Updated 6 months ago
frt03 / jax_dt
Minimal Decision Transformer Implementation written in Jax (Flax).
☆17Updated 2 years ago
sukhijab / maxinforl_torch
☆44Updated 7 months ago
vivekmyers / contrastive_metrics
Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"
☆27Updated last year
imgeorgiev / PWM
PWM: Policy Learning with Large World Models
☆55Updated this week
SonyResearch / simba
☆102Updated 5 months ago
marc-rigter / polygrad-world-models
Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024
☆66Updated last year
Alescontrela / viper_rl
Using advances in generative modeling to learn reward functions from unlabeled videos.
☆134Updated last year