betray12138 / UNICORN
The Codebase of <Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning> In NeurIPS 2024
☆18Updated last month
Alternatives and similar repositories for UNICORN:
Users that are interested in UNICORN are comparing it to the libraries listed below
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆35Updated last year
- ☆30Updated 2 years ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆23Updated 5 months ago
- ☆24Updated last year
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆37Updated last year
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- ☆61Updated 4 months ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆42Updated last year
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Updated last year
- Online Preference Alignment for Language Models via Count-based Exploration☆13Updated 2 months ago
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction☆27Updated last year
- ☆16Updated 7 months ago
- [NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…☆12Updated 2 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆33Updated 4 months ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆14Updated 7 months ago
- Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).☆74Updated last year
- ☆16Updated 2 years ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆25Updated 2 months ago
- [NeurIPS 2023] Efficient Diffusion Policy☆96Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆52Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year
- Meta-RL Model-Based Algorithm☆31Updated 10 months ago
- ☆31Updated 5 months ago
- Dateset Reset Policy Optimization☆30Updated 11 months ago
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆46Updated last year
- ☆11Updated 11 months ago
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆21Updated 4 months ago
- official implementation of QVPO☆27Updated 5 months ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)☆26Updated last year