MinghanLi / FiVE-BenchLinks
[ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
☆33Updated last month
Alternatives and similar repositories for FiVE-Bench
Users that are interested in FiVE-Bench are comparing it to the libraries listed below
Sorting:
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆38Updated last year
- MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation☆31Updated last year
- Training Autoregressive Image Generation models via Reinforcement Learning☆50Updated 2 months ago
- An innovative method designed to augment the capabilities of existing video diffusion models☆22Updated last year
- FreeCond: A Free Lunch for Input Conditions in Text-Guided Inpainting. FreeCond introduces a more generalized form💪 of the original inpa…☆15Updated 8 months ago
- MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills [SIGGRAPH 2025]☆74Updated 3 weeks ago
- EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]☆118Updated last week
- UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture☆87Updated last week
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆72Updated 7 months ago
- [ICLR 2025, AAAI 2026] official implementation of "Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generati…☆34Updated 2 weeks ago
- Official PyTorch implementation for the paper: "VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models"☆22Updated 6 months ago
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆88Updated last year
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆59Updated last year
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆55Updated 5 months ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Updated 9 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆45Updated 8 months ago
- Perceptual Artifacts Localization for Image Synthesis Tasks (ICCV 23')☆66Updated 2 years ago
- Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"☆39Updated 8 months ago
- ☆35Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Updated last year
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆78Updated 2 years ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆128Updated last year
- [ECCV2024] Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models☆47Updated last year
- ICCV2023-Diffusion-Papers☆108Updated 2 years ago
- The official repository of "Spectral Motion Alignment for Video Motion Transfer using Diffusion Models".☆31Updated last year
- Video Diffusion Transformers are In-Context Learners☆36Updated last year
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆46Updated last year
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation☆110Updated 9 months ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆25Updated last year
- Official repository for “PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss”☆175Updated last week