abenechehab / diclLinks
[ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcement Learning using Large Language Models".
☆26Updated 7 months ago
Alternatives and similar repositories for dicl
Users that are interested in dicl are comparing it to the libraries listed below
Sorting:
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆30Updated 4 months ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆59Updated 7 months ago
- The original Shared Recurrent Memory Transformer implementation☆31Updated 2 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆32Updated 3 months ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆18Updated last year
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆110Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆27Updated last year
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆33Updated 2 months ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆111Updated last month
- ☆23Updated last year
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025☆78Updated 7 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆111Updated 3 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆35Updated 10 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆64Updated 7 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆41Updated 10 months ago
- ☆70Updated last year
- Repo to reproduce the First-Explore paper results☆38Updated 9 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆74Updated 3 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆134Updated 4 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆72Updated last year
- Dateset Reset Policy Optimization☆30Updated last year
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆16Updated 6 months ago
- The official implementation of "Horizon Reduction Makes RL Scalable"☆139Updated last month
- this is for fun, ain't it grand!☆20Updated last week
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆262Updated 6 months ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆18Updated last year
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆68Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆108Updated last year