Zeqiang-Lai / Mini-DALLE3
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
☆314Updated last year
Alternatives and similar repositories for Mini-DALLE3
Users that are interested in Mini-DALLE3 are comparing it to the libraries listed below
Sorting:
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆150Updated 5 months ago
- [NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".☆355Updated 2 months ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆407Updated 10 months ago
- ☆200Updated last year
- NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models☆419Updated last year
- Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing☆227Updated last year
- ☆466Updated 8 months ago
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models☆511Updated last year
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆354Updated last year
- Multimodal Models in Real World☆503Updated 2 months ago
- ☆228Updated last year
- ☆176Updated 10 months ago
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆298Updated 10 months ago
- Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts☆325Updated last year
- ICLR 2024 (Spotlight)☆768Updated last year
- Put Your Face Everywhere in Seconds.☆312Updated last year
- [ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation☆491Updated 10 months ago
- Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"☆351Updated last year
- [TIP 2025] CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models 🔥☆213Updated 3 weeks ago
- UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization☆248Updated 2 weeks ago
- DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework☆342Updated 5 months ago
- [CVPR 2024] Official implementation of FreeDrag: Feature Dragging for Reliable Point-based Image Editing☆417Updated last month
- [TOG 2024]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter☆235Updated last month
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆136Updated 3 months ago
- [TMM 2025] StableIdentity: Inserting Anybody into Anywhere at First Sight 🔥☆258Updated 4 months ago
- Retrieval-Augmented Video Generation for Telling a Story☆256Updated last year
- ☆375Updated 11 months ago
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆226Updated 10 months ago
- ☆143Updated 10 months ago
- Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence☆390Updated 5 months ago