pickxiguapi / Uni-RLHF-PlatformLinks
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
☆36Updated 7 months ago
Alternatives and similar repositories for Uni-RLHF-Platform
Users that are interested in Uni-RLHF-Platform are comparing it to the libraries listed below
Sorting:
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆38Updated last year
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆66Updated 9 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆59Updated 9 months ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆78Updated 3 months ago
- ☆59Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆52Updated last year
- Codebase for HiP☆90Updated last year
- Official code repository for Prompt-DT.☆113Updated 2 years ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆44Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated last year
- Implementations of Intention-conditioned Flow Occupancy Models (InFOM)☆18Updated 3 weeks ago
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆82Updated last month
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆38Updated 4 months ago
- ☆89Updated 2 years ago
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"☆170Updated 7 months ago
- ☆45Updated last year
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆12Updated 9 months ago
- This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".☆42Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 11 months ago
- An RL-Friendly Vision-Language Model for Minecraft☆33Updated 9 months ago
- ☆79Updated last year
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆132Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated last year
- ☆28Updated last year
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆156Updated 2 years ago
- ☆45Updated 11 months ago
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆67Updated 5 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆70Updated last year
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated last year
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆30Updated last year