Official repository for the paper "MICo-150K: A Comprehensive Dataset for Multi-Image Composition".
☆115Mar 1, 2026Updated 3 weeks ago
Alternatives and similar repositories for MICo-150K
Users that are interested in MICo-150K are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fine-tune of Florence-2 for shot categorization.☆26Mar 6, 2025Updated last year
- official code for unigame☆19Nov 26, 2025Updated 4 months ago
- AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning☆96Nov 21, 2025Updated 4 months ago
- [NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark☆294Nov 5, 2025Updated 4 months ago
- CVPR2026 Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers☆71Mar 12, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆20Apr 15, 2025Updated 11 months ago
- [CVPR2026] VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding☆96Mar 17, 2026Updated last week
- FDFO: Finite Difference Flow Optimization☆66Mar 16, 2026Updated 2 weeks ago
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆28Mar 18, 2026Updated last week
- Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."☆84Feb 25, 2026Updated last month
- ☆14Sep 11, 2025Updated 6 months ago
- ☆13Jul 3, 2024Updated last year
- Official implementation of "InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention" (NeurIPS 2025)☆41Oct 17, 2025Updated 5 months ago
- ☆28Feb 2, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆35Sep 9, 2024Updated last year
- ☆13May 15, 2025Updated 10 months ago
- Instance-Level Salient Object Detection, Computer Vision and Image Understanding (CVIU), 2021.☆12Apr 23, 2021Updated 4 years ago
- The offical code of Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis☆68Feb 25, 2026Updated last month
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 9 months ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- Code of Decomposition and Completion Network for Salient Object Detection, TIP 2021.☆10Mar 30, 2023Updated 3 years ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆66Nov 1, 2024Updated last year
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- https://avocado-captioner.github.io/☆32Oct 16, 2025Updated 5 months ago
- LLaVA-Next for STVG☆18Dec 5, 2025Updated 3 months ago
- A PyTorch implementation of NormSoftmax based on BMVC 2019 paper "Classification is a Strong Baseline for Deep Metric Learning"☆10Mar 15, 2020Updated 6 years ago
- Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Transformer Train…☆29Mar 17, 2026Updated last week
- ☆22Nov 25, 2025Updated 4 months ago
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆18Jun 19, 2025Updated 9 months ago
- Layered Score Distillation for Disentangled Object Relighting☆23Jan 15, 2024Updated 2 years ago
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis☆86Feb 3, 2025Updated last year
- Source code of " LIVENet: A novel network for real-world low-light image denoising and enhancement", published in WACV 2024☆13Dec 20, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ImageNet-12k subset of ImageNet-21k (fall11)☆22Jun 13, 2023Updated 2 years ago
- [CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding☆83Jul 4, 2025Updated 8 months ago
- ☆16Sep 18, 2023Updated 2 years ago
- ☆17Mar 10, 2025Updated last year
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆44Feb 10, 2026Updated last month
- An implementation of torchngp + semantic-nerf☆13Sep 10, 2023Updated 2 years ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…☆210May 5, 2025Updated 10 months ago