mulanai / MuLan
MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)
☆135Updated 2 months ago
Alternatives and similar repositories for MuLan:
Users that are interested in MuLan are comparing it to the libraries listed below
- ☆175Updated 9 months ago
- Multimodal Models in Real World☆492Updated last month
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆71Updated 9 months ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆120Updated 5 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆228Updated last month
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆150Updated 4 months ago
- An initiative to replicate Sora☆104Updated last year
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆250Updated 3 weeks ago
- ☆142Updated 9 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆204Updated last week
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆198Updated last week
- [ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…☆303Updated 3 weeks ago
- ☆220Updated 8 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆296Updated 2 months ago
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆156Updated 5 months ago
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆221Updated 9 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆99Updated last month
- UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization☆245Updated 5 months ago
- 🔥 CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models☆209Updated 9 months ago
- ☆473Updated 4 months ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆144Updated 5 months ago
- ☆227Updated last year
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆123Updated 9 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆140Updated last month
- ☆84Updated 4 months ago
- ☆198Updated last year
- GenEval: An object-focused framework for evaluating text-to-image alignment☆220Updated last month
- ☆171Updated last year
- Official implementation code of the paper <AnyText2: Visual Text Generation and Editing With Customizable Attributes>☆84Updated last month
- ☆78Updated 2 weeks ago