Visual-AI / SCDLinks
The official repository for CVPRW2024 paper "What’s in a Name? Beyond Class Indices for Image Recognition"
☆12Updated 9 months ago
Alternatives and similar repositories for SCD
Users that are interested in SCD are comparing it to the libraries listed below
Sorting:
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 5 months ago
- ☆30Updated last year
- ☆14Updated last year
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆29Updated last month
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆23Updated 11 months ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆15Updated 9 months ago
- Implementation for "Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffu…☆13Updated last year
- DreamCube: 3D Panorama Generation via Multi-plane Synchronization☆58Updated this week
- [ECCV 2024] PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance☆21Updated 11 months ago
- Official pytorch implementation for SingleInsert☆27Updated last year
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆33Updated last year
- Fast Sprite Decomposition from Animated Graphics [ECCV2024]☆32Updated 9 months ago
- ☆17Updated last year
- FlexiFilm: Long Video Generation with Flexible Conditions☆31Updated last year
- A curated list of papers and resources for text-to-image evaluation.☆29Updated last year
- ☆20Updated 9 months ago
- ☆23Updated 7 months ago
- ☆21Updated 2 weeks ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Updated last year
- ☆10Updated 11 months ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆20Updated 2 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- The repo for: TriHuman: A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis☆16Updated 6 months ago
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Updated last year
- Fine-tune of Florence-2 for shot categorization.☆24Updated 3 months ago
- ☆22Updated last month
- Code for full fintuing Mochi model with FSDP (and CP)☆24Updated 2 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 10 months ago
- ☆28Updated 3 months ago
- [ECCV 2024] BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion☆21Updated 11 months ago