ddw2AIGROUP2CQUPT / Large-Scale-Multimodal-Face-DatasetsLinks
Millions-Level Face/Human-Scene Image-Text Datasets
☆23Updated 5 months ago
Alternatives and similar repositories for Large-Scale-Multimodal-Face-Datasets
Users that are interested in Large-Scale-Multimodal-Face-Datasets are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆98Updated 2 weeks ago
- Replication in Visual Diffusion Models: A Survey and Outlook☆31Updated last year
- a collection of awesome autoregressive visual generation models☆78Updated 6 months ago
- Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"☆84Updated this week
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Updated last year
- Unified Multi-modal IAA Baseline and Benchmark☆90Updated last year
- Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】☆80Updated 10 months ago
- [NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…☆80Updated 2 weeks ago
- Decoupled Textual Embeddings for Customized Image Generation (AAAI 2024)☆30Updated last year
- Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation☆35Updated 3 months ago
- [WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing☆17Updated 7 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆58Updated 7 months ago
- GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆101Updated 5 months ago
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆58Updated last year
- Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning☆182Updated 2 weeks ago
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆197Updated 7 months ago
- Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)☆209Updated 2 months ago
- [NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark☆224Updated this week
- ☆17Updated last year
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆146Updated 8 months ago
- Official code for K-LoRA (CVPR 2025)☆129Updated last month
- ICCV2023-Diffusion-Papers☆108Updated 2 years ago
- [ICCV 2025][Few-Step Student Surpasses Teacher Diffusion] Learning Few-Step Diffusion Models by Trajectory Distribution Matching☆57Updated last week
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆84Updated 6 months ago
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆55Updated 11 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆46Updated 4 months ago
- An unofficial implementation of the paper “DiffEdit: Diffusion-based semantic image editing with mask guidance”☆39Updated 2 years ago
- ☆88Updated 7 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆159Updated last month
- ☆51Updated 10 months ago