☆54May 6, 2025Updated 10 months ago
Alternatives and similar repositories for IPO
Users that are interested in IPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"☆38May 21, 2025Updated 10 months ago
- ☆18Oct 23, 2024Updated last year
- [CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models☆52Feb 21, 2026Updated last month
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆86May 4, 2025Updated 10 months ago
- ☆14Jul 17, 2024Updated last year
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models☆40Oct 30, 2025Updated 4 months ago
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆391Mar 26, 2025Updated 11 months ago
- Official Implementation of VideoDPO☆163Jun 1, 2025Updated 9 months ago
- Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Vi…☆240Mar 19, 2025Updated last year
- ☆20Jan 1, 2026Updated 2 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆101Oct 3, 2025Updated 5 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆439Sep 24, 2025Updated 5 months ago
- ☆30Mar 4, 2025Updated last year
- Exploring Representation-Aligned Latent Space for Better Generation☆18Updated this week
- ☆68Aug 16, 2024Updated last year
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆300Apr 23, 2025Updated 11 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆52Updated this week
- Benchmark dataset and code of MSRVTT-Personalization☆51Nov 10, 2025Updated 4 months ago
- ☆19May 10, 2025Updated 10 months ago
- ☆40Dec 16, 2025Updated 3 months ago
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆14Apr 2, 2025Updated 11 months ago
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆43Jan 9, 2026Updated 2 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆202Jan 7, 2026Updated 2 months ago
- ☆132Jun 24, 2025Updated 8 months ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆214Mar 11, 2026Updated last week
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆737Nov 27, 2025Updated 3 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆114Dec 4, 2025Updated 3 months ago
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆47Jul 1, 2025Updated 8 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- ☆46Mar 12, 2026Updated last week
- Unofficial implementation of Face0 with SDXL☆12Sep 1, 2023Updated 2 years ago
- ☆13Apr 7, 2022Updated 3 years ago
- Code and model for OFNet paper☆18Oct 17, 2019Updated 6 years ago
- ☆98Mar 3, 2025Updated last year
- USTC矩阵分析与应用课后题目总结与解答☆13Jul 7, 2023Updated 2 years ago
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆86Sep 18, 2025Updated 6 months ago
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆170Nov 5, 2024Updated last year
- ☆35Nov 28, 2024Updated last year
- ☆30May 9, 2024Updated last year