Visual-AI / SCD
The official repository for CVPRW2024 paper "What’s in a Name? Beyond Class Indices for Image Recognition"
☆12Updated 7 months ago
Alternatives and similar repositories for SCD:
Users that are interested in SCD are comparing it to the libraries listed below
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 2 months ago
- ☆14Updated last year
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆28Updated 3 months ago
- ☆28Updated last year
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆14Updated 7 months ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Updated 10 months ago
- [Diffusion Test-Time Refinement] Official repo for "FreSca: Unveiling the Scaling Space in Diffusion Models"☆25Updated last week
- [ECCV 2024] PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance☆20Updated 8 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 8 months ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆23Updated 8 months ago
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆33Updated last year
- Official pytorch implementation for SingleInsert☆26Updated 11 months ago
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆23Updated 9 months ago
- ☆10Updated 9 months ago
- ☆16Updated 9 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 11 months ago
- FlexiFilm: Long Video Generation with Flexible Conditions☆32Updated 11 months ago
- Official Implementation for "Block and Detail: Scaffolding Sketch-to-Image Generation"☆26Updated 5 months ago
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆36Updated 3 weeks ago
- UniCon: A Simple Approach to Unifying Diffusion-based Conditional Generation (ICLR 2025)☆24Updated last month
- Official implementation of "Real-SRGD: Enhancing Real-World Image Super-Resolution with Classifier-Free Guided Diffusion" [ACCV2024]☆17Updated 4 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- ☆20Updated 7 months ago
- ☆26Updated last month
- ☆9Updated last year
- ☆11Updated last year
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆45Updated 4 months ago
- 👆Pytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"☆25Updated 5 months ago
- ☆22Updated 5 months ago
- A curated list of papers and resources for text-to-image evaluation.☆29Updated last year