jhoon-cho / MBTL
Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)
☆19Updated 2 months ago
Alternatives and similar repositories for MBTL:
Users that are interested in MBTL are comparing it to the libraries listed below
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆20Updated 3 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆27Updated 7 months ago
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise☆32Updated 5 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆26Updated 3 months ago
- Learn online intrinsic rewards from LLM feedback☆34Updated 2 months ago
- ☆38Updated last week
- Lottery Ticket Adaptation☆37Updated 3 months ago
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆21Updated last year
- ☆21Updated 10 months ago
- ☆15Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 5 months ago
- ☆21Updated 4 months ago
- ☆42Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated 8 months ago
- Code for ``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆31Updated 3 weeks ago
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆27Updated last year
- ☆46Updated last week
- Exploration into the Firefly algorithm in Pytorch☆35Updated last week
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆13Updated 8 months ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated last month
- A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)☆63Updated 5 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last month
- ☆44Updated 3 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated 11 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- ☆18Updated 4 months ago