Official repository for Scone (Subject-driven Composition and Distinction Enhancement) model, designed to support multi-subject composition and subject distinction tasks in complex contexts.
☆28Jan 14, 2026Updated last month
Alternatives and similar repositories for Scone
Users that are interested in Scone are comparing it to the libraries listed below
Sorting:
- ☆27Jan 28, 2026Updated last month
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆73Feb 13, 2026Updated 2 weeks ago
- (NeurIPS 2024) One-shot Federated Learning via Synthetic Distiller-Distillate Communication☆13Mar 11, 2025Updated 11 months ago
- [MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation☆11Apr 3, 2023Updated 2 years ago
- SpeedVision is an AI-powered tool that detects and calculates vehicle speed from video footage using YOLO-based object detection and fram…☆10Sep 22, 2024Updated last year
- Calculation of the entropy of the batch of images (whole image or patches)☆10Oct 15, 2021Updated 4 years ago
- Dataset Quantization with Active Learning based Adaptive Sampling [ECCV 2024]☆10Jul 9, 2024Updated last year
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- Official implementation of ICCV 2025 paper - DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization☆22Jul 13, 2025Updated 7 months ago
- [ECCV 2024] Characterizing Robustness via Natural Input Gradients☆13Oct 14, 2024Updated last year
- Simple Tensorflow implementation of "MirrorGAN: Learning Text-to-image Generation by Redescription" (CVPR 2019)☆15Mar 23, 2020Updated 5 years ago
- CVPR 2025 Accepted Papers☆23Dec 20, 2025Updated 2 months ago
- 电子科大格院毕设LaTeX模板☆19Jan 17, 2025Updated last year
- CAD - Memory Efficient Convolutional Adapter for Segment Anything☆12Oct 4, 2024Updated last year
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆26Feb 14, 2026Updated last week
- base: https://github.com/Sense-GVT/Fast-BEV , delete time sequence,update mm releated ,add onnx export for tensorrt☆12May 12, 2023Updated 2 years ago
- [ICLR 2025] Official repository for the paper "Influence-Guided Diffusion for Dataset Distillation".☆15Feb 12, 2025Updated last year
- A replication of Google's VideoPoet model☆11Feb 18, 2024Updated 2 years ago
- Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"☆134Updated this week
- pytorch implementation of XMC-GAN☆11Jun 2, 2021Updated 4 years ago
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆20Apr 6, 2025Updated 10 months ago
- You Only Condense Once: Two Rules for Pruning Condensed Datasets (NeurIPS 2023)☆15Nov 18, 2023Updated 2 years ago
- Code for the paper Semantic-Guided Inpainting Network for Complex UrbanScenes Manipulation☆13Jul 7, 2021Updated 4 years ago
- ☆21Jun 3, 2023Updated 2 years ago
- ☆18Mar 21, 2025Updated 11 months ago
- Self-Supervised Dataset Distillation for Transfer Learning☆16Apr 10, 2024Updated last year
- How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?☆13Aug 16, 2023Updated 2 years ago
- Alpha version of our data-centric visual benchmark for training data selection☆16Aug 28, 2023Updated 2 years ago
- [ICCV 2025] DiffDoctor: Diagnosing Image Diffusion Models Before Treating☆39Sep 9, 2025Updated 5 months ago
- Official Pytorch implementation for our ACM MM 2023 paper: Moiré Backdoor Attack (MBA): A Novel Trigger for Pedestrian Detectors in the P…☆16Jan 22, 2024Updated 2 years ago
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20May 22, 2025Updated 9 months ago
- Less is More: High-value Data Selection for Visual Instruction Tuning☆17Jan 18, 2025Updated last year
- ☆48Feb 9, 2026Updated 2 weeks ago
- 中科大跨模态智能组-每周论文分享☆16Nov 20, 2022Updated 3 years ago
- ☆19Apr 16, 2025Updated 10 months ago
- ☆20Feb 24, 2025Updated last year
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Jul 9, 2020Updated 5 years ago
- Poster at ITSC 2024☆19Nov 12, 2024Updated last year
- UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation☆35Nov 24, 2025Updated 3 months ago