abenechehab / diclLinks

[ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcement Learning using Large Language Models".

☆22

Alternatives and similar repositories for dicl

Users that are interested in dicl are comparing it to the libraries listed below

Sorting:

ComputationalRobotics / TRAC
This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …
☆28Updated 2 months ago
rail-berkeley / SUPE
This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."
☆28Updated this week
vivekmyers / contrastive_metrics
Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"
☆27Updated last year
yilundu / ired_code_release
☆67Updated last year
conglu1997 / intelligent-go-explore
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
☆60Updated 4 months ago
CLAIRE-Labo / EvoTune
Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.
☆103Updated this week
jinpz / q_sharp
The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training
☆15Updated 4 months ago
vladisai / PLDM
☆36Updated this week
CEC-Agent / CEC
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆31Updated last year
sai-prasanna / dreaming_of_many_worlds
☆23Updated 9 months ago
lucidrains / SAC-pytorch
Implementation of Soft Actor Critic and some of its improvements in Pytorch
☆60Updated 5 months ago
DHDev0 / Muzero-unplugged
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆27Updated 3 weeks ago
Wenyueh / game_theory
How to create rational LLM-based agents? Using game-theoretic workflows!
☆72Updated last month
keraJLi / synthetic-gymnax
Drop-in environment replacements that make your RL algorithm train faster.
☆21Updated last year
NVlabs / gbrl_sb3
GBRL-based Actor-Critic algorithms implemented in stable-baselines3
☆35Updated 2 weeks ago
jennyzzt / omni
OMNI: Open-endedness via Models of human Notions of Interestingness
☆50Updated 5 months ago
Improbable-AI / orso
☆13Updated 4 months ago
dunnolab / xland-minigrid-datasets
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025
☆75Updated 5 months ago
lucidrains / improving-transformers-world-model-for-rl
Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch
☆128Updated 2 months ago
ml-jku / LRAM
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
☆33Updated 8 months ago
vmicheli / delta-iris
Efficient World Models with Context-Aware Tokenization. ICML 2024
☆105Updated 9 months ago
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆105Updated 3 weeks ago
Aloriosa / srmt
The original Shared Recurrent Memory Transformer implementation
☆27Updated last week
pickxiguapi / Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…
☆36Updated 7 months ago
facebookresearch / oni
Learn online intrinsic rewards from LLM feedback
☆41Updated 7 months ago
heatz123 / tldr
Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
☆32Updated 9 months ago
gauthamvasan / avg
Action Value Gradient Algorithm
☆21Updated last month
Shalev-Lifshitz / MultiAgentVerification
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
☆19Updated 4 months ago
shenao-zhang / BARL
Bayes-Adaptive RL for LLM Reasoning
☆37Updated last month
shangshang-wang / Resa
Resa: Transparent Reasoning Models via SAEs
☆39Updated last month