Zeqiang-Lai / Anything2ImageLinks

Generate image from anything with ImageBind and Stable Diffusion

☆200

Alternatives and similar repositories for Anything2Image

Users that are interested in Anything2Image are comparing it to the libraries listed below

Sorting:

sail-sg / BindDiffusion
BindDiffusion: One Diffusion Model to Bind Them All
☆164Updated 2 years ago
Zeqiang-Lai / Mini-DALLE3
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
☆313Updated last year
aim-uofa / AutoStory
[IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
☆151Updated last year
ExponentialML / Video-BLIP2-Preprocessor
A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it
☆141Updated last year
fabawi / ImageBind-LoRA
Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA
☆193Updated last year
baaivision / vid2vid-zero
Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models
☆357Updated 2 years ago
mshukor / UnIVAL
[TMLR23] Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.
☆232Updated last year
md-mohaiminul / VideoRecap
☆200Updated last year
JiauZhang / DragDiffusion
Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing
☆226Updated 2 years ago
JourneyDB / JourneyDB
☆180Updated 2 weeks ago
AILab-CVC / Animate-A-Story
Retrieval-Augmented Video Generation for Telling a Story
☆259Updated last year
feizc / IEA
Image Editing Anything
☆116Updated 2 years ago
kaleido-lab / dolphin
General video interaction platform based on LLMs, including Video ChatGPT
☆254Updated 2 years ago
SHI-Labs / VCoder
[CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models
☆280Updated last year
mulanai / MuLan
MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)
☆143Updated 10 months ago
OPPO-Mente-Lab / Edit_Everything
☆92Updated 2 years ago
showlab / ShowAnything
☆82Updated 2 years ago
Yuxinn-J / Scenimefy
[ICCV 2023] Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation
☆286Updated 9 months ago
Vision-CAIR / ChatCaptioner
Official Repository of ChatCaptioner
☆467Updated 2 years ago
JiauZhang / Text2Video-Zero
Implementation of Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
☆87Updated 2 years ago
Deaddawn / DreamFrame-code
☆184Updated 3 months ago
OPPO-Mente-Lab / Subject-Diffusion
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
☆313Updated last year
invictus717 / InteractiveVideo
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
☆131Updated last year
LPengYang / FreeDrag
[CVPR 2024] Official implementation of FreeDrag: Feature Dragging for Reliable Point-based Image Editing
☆420Updated 7 months ago
xiaoqian-shen / StoryGPT-V
[CVPR 2025] Official PyTorch implementation of StoryGPT-V
☆40Updated 5 months ago
VPGTrans / VPGTrans
Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.
☆271Updated 2 years ago
salesforce / GlueGen
☆65Updated 5 months ago
G-U-N / Gen-L-Video
The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".
☆304Updated last month
yukw777 / EILEV
EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties
☆131Updated last year
Zhendong-Wang / Prompt-Diffusion
Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"
☆411Updated last year