PKU-RL / AdaRefiner
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆10Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for AdaRefiner
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆151Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆31Updated 7 months ago
- ☆67Updated last year
- Official code repository for Prompt-DT.☆98Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆65Updated 6 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- Extreme Q-Learning: Max Entropy RL without Entropy☆80Updated last year
- An RL-Friendly Vision-Language Model for Minecraft☆26Updated last month
- Implementation of Multi-Game Decision Transformers in PyTorch☆43Updated last year
- ☆15Updated last month
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated this week
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆72Updated 7 months ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆54Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆92Updated this week
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆32Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆23Updated 8 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 4 months ago
- Implementation of TWOSOME☆49Updated 6 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆59Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆26Updated 5 months ago
- ☆45Updated 2 years ago
- ☆31Updated last year
- Implementation of the paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆13Updated last month
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆31Updated last year
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆41Updated 9 months ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆25Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆71Updated 2 years ago
- This project provides a set of translators to convert OpenAI Gym environments into text-based environments. It is designed to investigate…☆15Updated 5 months ago