Visual-AI / SCD
The official repository for CVPRW2024 paper "What’s in a Name? Beyond Class Indices for Image Recognition"
☆12Updated 8 months ago
Alternatives and similar repositories for SCD:
Users that are interested in SCD are comparing it to the libraries listed below
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 3 months ago
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆28Updated 3 months ago
- ☆28Updated last year
- ☆14Updated last year
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆15Updated 8 months ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆23Updated 9 months ago
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆33Updated last year
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Updated 11 months ago
- ☆17Updated 10 months ago
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆23Updated 10 months ago
- A curated list of papers and resources for text-to-image evaluation.☆29Updated last year
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆16Updated 2 weeks ago
- Implementation for "Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffu…☆13Updated last year
- ☆10Updated 9 months ago
- Official pytorch implementation for SingleInsert☆26Updated last year
- [ECCV 2024] PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance☆21Updated 9 months ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆28Updated last year
- ☆21Updated last year
- ☆11Updated last year
- ☆9Updated last year
- The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation☆13Updated this week
- ☆25Updated last month
- Official Implementation for "Block and Detail: Scaffolding Sketch-to-Image Generation"☆29Updated 6 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 9 months ago
- Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"☆38Updated this week
- FlexiFilm: Long Video Generation with Flexible Conditions☆32Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- ☆20Updated 7 months ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆17Updated 7 months ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆30Updated 5 months ago