☆54May 6, 2025Updated 9 months ago
Alternatives and similar repositories for IPO
Users that are interested in IPO are comparing it to the libraries listed below
Sorting:
- The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"☆38May 21, 2025Updated 9 months ago
- [CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models☆50Feb 21, 2026Updated last week
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models☆40Oct 30, 2025Updated 4 months ago
- ☆18Oct 23, 2024Updated last year
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆14Apr 2, 2025Updated 11 months ago
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆380Mar 26, 2025Updated 11 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 9 months ago
- Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Vi…☆236Mar 19, 2025Updated 11 months ago
- ☆14Jul 17, 2024Updated last year
- Official Implementation of VideoDPO☆160Jun 1, 2025Updated 9 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆428Sep 24, 2025Updated 5 months ago
- Unofficial implementation of Face0 with SDXL☆12Sep 1, 2023Updated 2 years ago
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆38Jan 9, 2026Updated last month
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆45Jul 1, 2025Updated 8 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆100Oct 3, 2025Updated 5 months ago
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆301Apr 23, 2025Updated 10 months ago
- Exploring Representation-Aligned Latent Space for Better Generation☆17Feb 4, 2025Updated last year
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 3 months ago
- ☆20Jan 1, 2026Updated 2 months ago
- ☆22Mar 7, 2025Updated 11 months ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆25Apr 14, 2025Updated 10 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆48Sep 8, 2025Updated 5 months ago
- ☆40Dec 16, 2025Updated 2 months ago
- ☆132Jun 24, 2025Updated 8 months ago
- ☆28Mar 4, 2025Updated 11 months ago
- Generated Faces in the Wild Dataset and Code☆18Mar 2, 2025Updated last year
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Jan 7, 2026Updated last month
- Code and model for OFNet paper☆18Oct 17, 2019Updated 6 years ago
- Official implementation of ATI: Any Trajectory Instruction for Controllable Video Generation. https://arxiv.org/pdf/2505.22944☆336Aug 7, 2025Updated 6 months ago
- [ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆106Feb 6, 2026Updated 3 weeks ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆66May 7, 2025Updated 9 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Sep 1, 2025Updated 6 months ago
- [ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image☆22Sep 15, 2025Updated 5 months ago
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆721Nov 27, 2025Updated 3 months ago
- All-round Creator and Editor☆240Oct 16, 2025Updated 4 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆112Dec 4, 2025Updated 2 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆210Jan 27, 2026Updated last month
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆53Jan 5, 2026Updated last month