jhoon-cho / MBTLLinks
Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)
☆25Updated 8 months ago
Alternatives and similar repositories for MBTL
Users that are interested in MBTL are comparing it to the libraries listed below
Sorting:
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- Code for [ICML2025]``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆54Updated 3 months ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆45Updated last year
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆58Updated last month
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆39Updated last year
- ☆53Updated 2 months ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆29Updated 4 months ago
- ☆23Updated 11 months ago
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆26Updated 6 months ago
- ☆16Updated 9 months ago
- Repo to reproduce the First-Explore paper results☆38Updated 8 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆20Updated 6 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆35Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆45Updated 2 years ago
- Implementations of Curious Replay for model-based adaptation.☆41Updated 2 years ago
- Bayes-Adaptive RL for LLM Reasoning☆37Updated 3 months ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- Multi-agent active perception with prediction rewards☆11Updated 4 years ago
- ☆11Updated last year
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆18Updated last year
- Online Preference Alignment for Language Models via Count-based Exploration☆16Updated 7 months ago
- Decision Transformer for offline single-agent autonomous highway driving☆27Updated 2 years ago
- Swarm learning algorithm☆11Updated 4 years ago
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆20Updated 10 months ago
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Updated last year
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆27Updated last year
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆35Updated last year
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Code repo for ICML'23 Searching Large Neighborhoods for Integer Linear Programs with Contrastive Learning☆43Updated 2 years ago