pickxiguapi / Clean-Offline-RLHF
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
☆31Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for Clean-Offline-RLHF
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated 7 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆51Updated last month
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆25Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆25Updated 6 months ago
- Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆23Updated 8 months ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆32Updated last year
- ☆20Updated last year
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆13Updated last year
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 3 months ago
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆50Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆37Updated 6 months ago
- ☆15Updated last year
- ☆45Updated 9 months ago
- [NeurIPS 2024] GenRL: Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning the…☆58Updated 3 months ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆30Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- ☆17Updated 6 months ago
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆25Updated 5 months ago
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆50Updated 9 months ago
- Deep reinforcement learning-basedskill transfer and composition method☆9Updated 4 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆25Updated last year
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆23Updated last year
- ☆51Updated 8 months ago
- PWM: Policy Learning with Large World Models☆37Updated 2 months ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆43Updated 7 months ago
- ☆34Updated 3 months ago
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆14Updated last year
- ☆41Updated 4 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆57Updated last year