pickxiguapi / Clean-Offline-RLHFLinks
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
☆38Updated last year
Alternatives and similar repositories for Clean-Offline-RLHF
Users that are interested in Clean-Offline-RLHF are comparing it to the libraries listed below
Sorting:
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 7 months ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆31Updated 2 years ago
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆27Updated last year
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆28Updated 2 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆59Updated 9 months ago
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay