mindspore-lab / mindoneLinks
one for all, Optimal generator with No Exception
☆446Updated this week
Alternatives and similar repositories for mindone
Users that are interested in mindone are comparing it to the libraries listed below
Sorting:
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]☆309Updated last year
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation☆226Updated 9 months ago
- A collection of diffusion models based on MindSpore☆163Updated last year
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆193Updated 5 months ago
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆290Updated 2 weeks ago
- A reading list of video generation☆610Updated this week
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆553Updated last year
- Official code for ICCV 205 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distilla…☆78Updated last month
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆422Updated 9 months ago
- ☆472Updated last month
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆33Updated 5 months ago
- [CVPR 2024] DeepCache: Accelerating Diffusion Models for Free☆921Updated last year
- An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation☆580Updated this week
- An initiative to replicate Sora☆104Updated last year
- Official code of SmartEdit [CVPR-2024 Highlight]☆349Updated last year
- 📚 Collection of awesome generation acceleration resources.☆311Updated last month
- GenEval: An object-focused framework for evaluating text-to-image alignment☆335Updated 5 months ago
- A list for Text-to-Video, Image-to-Video works☆242Updated 2 months ago
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆293Updated 4 months ago
- [CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models☆251Updated 8 months ago
- StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!☆426Updated last month
- Multimodal Models in Real World☆533Updated 5 months ago
- ☆182Updated last year
- [ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching☆170Updated 5 months ago
- ☆360Updated 9 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆302Updated 6 months ago
- [ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…☆325Updated last month
- 扩散模型算法基础文档、训练、实验、部署等仓库☆40Updated 5 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆173Updated 10 months ago
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆110Updated 4 months ago