Ryann-Ran / SconeLinks
Official repository for Scone (Subject-driven Composition and Distinction Enhancement) model, designed to support multi-subject composition and subject distinction tasks in complex contexts.
☆28Updated 3 weeks ago
Alternatives and similar repositories for Scone
Users that are interested in Scone are comparing it to the libraries listed below
Sorting:
- The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation☆23Updated 5 months ago
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆140Updated this week
- [NeurIPS 2025 Spotlight] VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.☆151Updated 3 months ago
- UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation☆121Updated last month
- [ICCV 2025] Fine-Tuning Visual Autoregressive Models for Subject-Driven Generation☆25Updated 5 months ago
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing☆23Updated 5 months ago
- ☆27Updated 9 months ago
- [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆73Updated 3 months ago
- ☆29Updated 10 months ago
- (ICCV 2025)This repository is the official implementation of AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detect…☆159Updated 6 months ago
- [NeurIPS2024]☆36Updated last year
- [NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…☆104Updated 3 weeks ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆128Updated last year
- EARL: Editing with Autoregression and RL☆42Updated 2 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆182Updated 3 months ago
- [ICLR'26] Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?☆48Updated last week
- [ICCV 2025 Highlight] LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs☆19Updated 2 months ago
- Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model.☆77Updated 7 months ago
- [CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution☆216Updated last month
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆108Updated 4 months ago
- Unified Multi-modal IAA Baseline and Benchmark☆92Updated last year
- EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling☆209Updated this week
- An unofficial implementation of the paper “DiffEdit: Diffusion-based semantic image editing with mask guidance”☆39Updated 2 years ago
- Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation☆42Updated 6 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆42Updated 10 months ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆53Updated 4 months ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆104Updated 9 months ago
- [CVPR2025] Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters☆43Updated 10 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆186Updated 8 months ago
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆15Updated last year