PKU-Alignment / safe-sora
SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enhance the helpfulness and harmlessness of Large Vision Models (LVMs).
☆24Updated 3 weeks ago
Related projects: ⓘ
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆26Updated 4 months ago
- This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Co…☆66Updated 2 months ago
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆16Updated 6 months ago
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆62Updated 2 weeks ago
- The offical Implementation of "Instruction-Guided Visual Masking"☆21Updated last month
- Official repo for "iVideoGPT: Interactive VideoGPTs are Scalable World Models", https://arxiv.org/abs/2405.15223☆60Updated 2 weeks ago
- [CVPR2024] This is the official implement of MP5☆72Updated 2 months ago
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆53Updated 7 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated 5 months ago
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆37Updated 10 months ago
- Paper collections of the continuous effort start from World Models.☆127Updated 2 months ago
- Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)☆26Updated last year
- [ACL'2024] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆44Updated 3 weeks ago
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆49Updated 3 weeks ago
- [ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"☆113Updated 8 months ago
- Align Anything: Training All-modality Model with Feedback☆100Updated this week
- HAZARD challenge☆25Updated 4 months ago
- ☆53Updated 2 months ago
- ☆102Updated 2 months ago
- ☆16Updated 5 months ago
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆43Updated 3 months ago
- ☆63Updated 3 weeks ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆35Updated 8 months ago
- ⛏💎 STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆27Updated 8 months ago
- ☆23Updated 2 months ago
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆25Updated 5 months ago
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆37Updated last year
- ☆32Updated last year
- Official code for ICLR 2024 paper Do Generated Data Always Help Contrastive Learning?☆25Updated 5 months ago
- Codebase for HiP☆84Updated 9 months ago