JaydenLyh / SmPOLinks
Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences (ICML 2025)
☆25Updated 3 months ago
Alternatives and similar repositories for SmPO
Users that are interested in SmPO are comparing it to the libraries listed below
Sorting:
- InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment (CVPR 2025 Highlight)☆38Updated 3 months ago
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆42Updated 3 months ago
- [ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection☆23Updated 3 months ago
- Official code for DeepSound-V1☆12Updated 4 months ago
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆121Updated last week
- [CVPR] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization☆43Updated 2 months ago
- [ICCV 2025] FonTS: Text Rendering with Typography and Style Controls☆29Updated last month
- Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆57Updated 3 months ago
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆99Updated 3 weeks ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆307Updated 2 weeks ago
- [ICCV25] USP: Unified Self-Supervised Pretraining for Image Generation and Understanding☆89Updated 3 months ago
- ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆18Updated 3 months ago
- Official implementation of MC-LLaVA.☆140Updated last month
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆152Updated 2 weeks ago
- [CVPR 2025] DreamRelation: Bridging Customization and Relation Generation☆17Updated last month
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆218Updated last month
- [CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation☆62Updated last month
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆171Updated 4 months ago
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models☆32Updated last month
- Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unifie…☆273Updated last week
- An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆87Updated 2 weeks ago
- [NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…☆71Updated last week
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆24Updated 4 months ago
- ☆156Updated 3 months ago
- [CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding☆31Updated 6 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.☆53Updated this week
- Official code repo for our work "Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models"☆46Updated 3 months ago
- A collection of vision foundation models unifying understanding and generation.☆55Updated 9 months ago
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆142Updated 6 months ago
- UniGenBench: A Unified T2I Generation Benchmark☆48Updated this week