Zeqiang-Lai / Anything2Image
Generate image from anything with ImageBind and Stable Diffusion
☆193Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Anything2Image
- BindDiffusion: One Diffusion Model to Bind Them All☆162Updated last year
- ☆145Updated 2 months ago
- Retrieval-Augmented Video Generation for Telling a Story☆250Updated 9 months ago
- Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA☆176Updated 11 months ago
- Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models☆302Updated 10 months ago
- Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing☆227Updated last year
- ☆141Updated 4 months ago
- ☆166Updated 4 months ago
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆91Updated 2 weeks ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆387Updated 4 months ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆343Updated last year
- [IEEE TVCG 2024] Customized Video Generation Using Textual and Structural Guidance☆185Updated 8 months ago
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆284Updated 4 months ago
- [ICLR 2024] Github Repo for "HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion"☆493Updated last year
- Multimodal Models in Real World☆404Updated 3 weeks ago
- Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts☆320Updated last year
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆456Updated this week
- Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"☆221Updated last year
- [ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation☆396Updated 7 months ago
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"☆784Updated last year
- [ICCV 2023] Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation☆265Updated last year
- VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024☆261Updated 7 months ago
- [NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".☆311Updated 5 months ago
- Code for Text2Performer. Paper: Text2Performer: Text-Driven Human Video Generation☆323Updated last year
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models☆490Updated 10 months ago
- ☆81Updated last year
- The HD-VG-130M Dataset☆109Updated 7 months ago
- Official Implementation of FreeDrag (CVPR 2024)☆413Updated 6 months ago
- [SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters☆254Updated 7 months ago
- ICLR 2024 (Spotlight)☆726Updated 8 months ago