betray12138 / UNICORN
The Codebase of <Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning> In NeurIPS 2024
☆22Updated 2 months ago
Alternatives and similar repositories for UNICORN
Users that are interested in UNICORN are comparing it to the libraries listed below
Sorting:
- ☆30Updated 2 years ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆37Updated last year
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆37Updated last year
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆25Updated 2 weeks ago
- Online Preference Alignment for Language Models via Count-based Exploration☆14Updated 4 months ago
- ☆24Updated last year
- ☆13Updated last month
- ☆61Updated 6 months ago
- [NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…☆15Updated 4 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 5 months ago
- ☆28Updated last week
- ☆14Updated 2 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Updated last year
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated last month
- The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.☆51Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆47Updated last year
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆28Updated 2 weeks ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆9Updated 3 months ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 7 months ago
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆28Updated last year
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- ☆44Updated last month
- Official repository of Action-Free Guide☆11Updated 2 years ago
- ☆17Updated 8 months ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year
- Graph Diffusion Policy Optimization☆35Updated last year
- Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces☆40Updated last year
- Minimal RLHF implementation built on top of minGPT.☆28Updated 10 months ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆18Updated 4 years ago