erwold / qwen2vl-flux
β350Updated 3 weeks ago
Alternatives and similar repositories for qwen2vl-flux:
Users that are interested in qwen2vl-flux are comparing it to the libraries listed below
- Training-free Regional Prompting for Diffusion Transformers π₯β450Updated 2 weeks ago
- β490Updated this week
- Memory optimized finetuning scripts for CogVideoX & Mochi using TorchAO and DeepSpeedβ513Updated last week
- text to image to generation: CogView3-Plus and CogView3(ECCV 2024)β264Updated 2 months ago
- All-round Creator and Editorβ157Updated 3 weeks ago
- π₯ CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Modelsβ198Updated 5 months ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretrainiβ¦β512Updated 4 months ago
- β495Updated 3 weeks ago
- β392Updated 3 months ago
- β301Updated last month
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generationβ476Updated 3 months ago
- Multimodal Models in Real Worldβ415Updated last month
- πΉ A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.β554Updated this week
- β260Updated 4 months ago
- β183Updated 4 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2β270Updated last month
- Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generationβ414Updated 2 months ago
- IP Adapter Instructβ188Updated 4 months ago
- UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalizationβ212Updated last month
- NeurIPS 2024β333Updated 2 months ago
- Implementation of "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"β146Updated this week
- β275Updated last week
- β404Updated 8 months ago
- Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Modelβ396Updated 6 months ago
- Official implementation of "Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance" (NeurIPS 2024)β265Updated last week
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Modelβ234Updated 4 months ago
- Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Modelsβ239Updated last month
- Rectified Flow Inversion (RF-Inversion)β288Updated last month
- The best OSS video generation modelsβ126Updated last month
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!β374Updated this week