Adamdad / Awesome-ComposableAILinks
A curated list of Composable AI methods: Building AI system by composing modules.
☆197Updated 2 years ago
Alternatives and similar repositories for Awesome-ComposableAI
Users that are interested in Awesome-ComposableAI are comparing it to the libraries listed below
Sorting:
- Generate image from anything with ImageBind and Stable Diffusion☆201Updated 2 years ago
- General video interaction platform based on LLMs, including Video ChatGPT☆254Updated 2 years ago
- BindDiffusion: One Diffusion Model to Bind Them All☆164Updated 2 years ago
- ☆180Updated 2 months ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆356Updated 2 years ago
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆413Updated last year
- Open reproduction of MUSE for fast text2image generation.☆359Updated last year
- Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.☆455Updated 2 years ago
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆320Updated 2 years ago
- [CVPR 2025] Official PyTorch implementation of StoryGPT-V☆40Updated 7 months ago
- Image Editing Anything☆116Updated 2 years ago
- An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal …☆364Updated 2 years ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆757Updated 2 years ago
- Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models☆313Updated 2 years ago
- Unofficial implementation of Tune-A-Video☆193Updated 3 years ago
- Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing☆226Updated 2 years ago
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆712Updated last year
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆441Updated last year
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆294Updated 2 years ago
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆280Updated last year
- ☆93Updated 2 years ago
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆506Updated 3 months ago
- ☆82Updated 2 years ago
- ☆148Updated 2 years ago
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆521Updated last year
- [IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model☆106Updated 10 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]☆176Updated last month
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆298Updated 5 months ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Updated 2 years ago
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆314Updated last year