betray12138 / UNICORN
The Codebase of <Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning> In NeurIPS 2024
☆12Updated 4 months ago
Alternatives and similar repositories for UNICORN:
Users that are interested in UNICORN are comparing it to the libraries listed below
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆33Updated 10 months ago
- the training and inference code and data for LLMOPT☆17Updated 3 months ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Updated last year
- Implementation of the paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆13Updated 3 months ago
- ☆28Updated last year
- Meta-RL Model-Based Algorithm☆29Updated 8 months ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆25Updated 10 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆33Updated 2 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated 3 months ago
- ☆19Updated 3 months ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 2 years ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆19Updated this week
- ☆14Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated last year
- (ICML 2024) The official code for Value-Evolutionary-Based Reinforcement Learning☆13Updated 6 months ago
- ☆31Updated 3 months ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆34Updated 10 months ago
- ☆23Updated 10 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆51Updated last year
- Minimal RLHF implementation built on top of minGPT.☆24Updated 6 months ago
- ☆9Updated last year
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆13Updated 7 months ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated last year
- Implicit Distributional Actor Critic☆10Updated 3 years ago
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆27Updated last year
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆12Updated 10 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆39Updated 9 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆24Updated last month