Official implementation of Add-SD: Rational Generation without Manual Reference.
☆28Aug 19, 2024Updated last year
Alternatives and similar repositories for Add-SD
Users that are interested in Add-SD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆38Sep 10, 2024Updated last year
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆55Feb 1, 2024Updated 2 years ago
- CVPR 2025 Workshop on CVEU.☆42Jun 12, 2025Updated 10 months ago
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis☆86Feb 3, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- "Visual Prompt Selection for In-Context Learning Segmentation Framework"☆15Dec 13, 2024Updated last year
- Video Diffusion State Space Models☆19Mar 27, 2024Updated 2 years ago
- [ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"☆55Feb 10, 2025Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing☆30Mar 29, 2024Updated 2 years ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆52Oct 14, 2024Updated last year
- (CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Models☆52Sep 10, 2025Updated 7 months ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- An innovative method designed to augment the capabilities of existing video diffusion models☆22May 10, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repo holds the research projects of our lab.☆11Jan 20, 2024Updated 2 years ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Oct 13, 2024Updated last year
- ☆15Jul 13, 2023Updated 2 years ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆22Sep 5, 2025Updated 7 months ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Sep 11, 2024Updated last year
- ConceptsDreambooth☆19Nov 30, 2022Updated 3 years ago
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following☆32Jan 24, 2025Updated last year
- ☆42May 15, 2025Updated 11 months ago
- [TMLR] Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free U…☆74Nov 29, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR2024] Mask Grounding for Referring Image Segmentation☆28Jul 22, 2024Updated last year
- Inference code for DWCode☆35Oct 24, 2023Updated 2 years ago
- [CVPR'25] MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models☆65May 27, 2025Updated 11 months ago
- [ICCV 2025 Highlight] Panorama Generation as a Next-Token Prediction Task.☆48Oct 29, 2025Updated 6 months ago
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆114Sep 13, 2024Updated last year
- ☆18Apr 4, 2025Updated last year
- Video Diffusion Transformers are In-Context Learners☆36Jan 6, 2025Updated last year
- ☆19Oct 23, 2024Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2024] Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.☆15Mar 12, 2024Updated 2 years ago
- ☆27Mar 3, 2025Updated last year
- ISE: Implicit Sample Extension for Unsupervised Person Re-Identification (CVPR2022)☆13Dec 23, 2022Updated 3 years ago
- DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection☆21Oct 5, 2023Updated 2 years ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆72Jun 3, 2024Updated last year
- Evaluating language models on word puzzle games☆10Oct 25, 2024Updated last year
- 3D-Aware Video Generation☆75Nov 15, 2022Updated 3 years ago