PKU-RL / AdaRefiner
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆13Updated 7 months ago
Alternatives and similar repositories for AdaRefiner:
Users that are interested in AdaRefiner are comparing it to the libraries listed below
- ☆74Updated last year
- ☆22Updated 5 months ago
- Official code repository for Prompt-DT.☆107Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆159Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆34Updated last month
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆54Updated 5 months ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆46Updated 2 years ago
- Implementation of TWOSOME☆66Updated 2 months ago
- Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆25Updated last year
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆29Updated 3 years ago
- ☆12Updated last year
- ☆27Updated last year
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆32Updated 11 months ago
- CORRO code☆35Updated 2 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 2 years ago
- Synthetic Experience Replay☆88Updated 9 months ago
- ☆59Updated 4 months ago
- ☆33Updated last year
- Overcooked human-AI experiment platform☆37Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆84Updated 2 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆73Updated 2 years ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆35Updated 11 months ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆32Updated 3 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆41Updated 11 months ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆161Updated last week
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆42Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆77Updated 3 months ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆51Updated last year