erwold / qwen2vl-flux
β424Updated 2 months ago
Alternatives and similar repositories for qwen2vl-flux:
Users that are interested in qwen2vl-flux are comparing it to the libraries listed below
- Official implementation of OneDiffusion paperβ596Updated 2 months ago
- Training-free Regional Prompting for Diffusion Transformers π₯β554Updated 2 months ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretrainiβ¦β543Updated 6 months ago
- Enhance-A-Video: Better Generated Video for Freeβ390Updated this week
- text to image to generation: CogView3-Plus and CogView3(ECCV 2024)β283Updated last month
- All-round Creator and Editorβ186Updated last month
- β374Updated 3 months ago
- Memory-optimized training scripts for video models based on Diffusersβ860Updated this week
- [ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Imageβ¦β284Updated 2 months ago
- Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"β440Updated 3 weeks ago
- β587Updated 2 months ago
- A pipeline parallel training script for diffusion models.β518Updated this week
- Illumination Drawing Tools for Text-to-Image Diffusion Modelsβ537Updated last month
- SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.β464Updated this week
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generationβ504Updated 5 months ago
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!β435Updated 2 months ago
- β482Updated last month
- β208Updated 6 months ago
- πΉ A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.β644Updated 2 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2β285Updated 2 weeks ago
- Multimodal Models in Real Worldβ437Updated 3 months ago
- β406Updated 5 months ago
- π₯ CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Modelsβ203Updated 7 months ago
- NeurIPS 2024β352Updated 4 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequencesβ273Updated 6 months ago
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Modelβ236Updated 6 months ago
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidanceβ210Updated last week
- β416Updated 10 months ago
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generationβ215Updated 7 months ago