betray12138 / UNICORNLinks
The Codebase of <Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning> In NeurIPS 2024
☆22Updated 4 months ago
Alternatives and similar repositories for UNICORN
Users that are interested in UNICORN are comparing it to the libraries listed below
Sorting:
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆38Updated last year
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Updated last year
- Online Preference Alignment for Language Models via Count-based Exploration☆14Updated 6 months ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆28Updated 2 months ago
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆37Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 7 months ago
- ☆30Updated 2 years ago
- Meta-RL Model-Based Algorithm☆38Updated 2 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆58Updated last year
- The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.☆55Updated this week
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆45Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year
- [NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…☆19Updated last month
- This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".☆25Updated last year
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆19Updated 3 years ago
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆12Updated 9 months ago
- Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces☆41Updated last year
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆32Updated 2 months ago
- ☆62Updated 8 months ago
- ☆45Updated 11 months ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆17Updated 9 months ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆18Updated 4 months ago
- Minimal RLHF implementation built on top of minGPT.☆30Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- ☆47Updated 3 months ago
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆31Updated 6 months ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆52Updated last year
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- ☆32Updated 8 months ago