pickxiguapi / Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
☆30Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for Uni-RLHF-Platform
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆31Updated 7 months ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆32Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆51Updated last month
- ☆20Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆25Updated 6 months ago
- Official code repository for Prompt-DT.☆96Updated 2 years ago
- [NeurIPS 2024] GenRL: Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning the…☆58Updated 3 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆37Updated 6 months ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆43Updated 7 months ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆25Updated last year
- An RL-Friendly Vision-Language Model for Minecraft☆25Updated 3 weeks ago
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆16Updated 5 months ago
- Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆23Updated 8 months ago
- ☆45Updated 9 months ago
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆67Updated last month
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆39Updated last year
- [ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"☆127Updated 2 weeks ago
- ☆22Updated 10 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆89Updated this week
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 3 months ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆36Updated 8 months ago
- ☆17Updated 6 months ago
- ☆51Updated 8 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆150Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆49Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated 11 months ago
- Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).☆71Updated 7 months ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆85Updated 11 months ago
- Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"☆42Updated last year
- ☆60Updated 5 months ago