mindspore-lab / mindoneLinks
one for all, Optimal generator with No Exception
☆459Updated last week
Alternatives and similar repositories for mindone
Users that are interested in mindone are comparing it to the libraries listed below
Sorting:
- A reading list of video generation☆628Updated last week
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation☆228Updated 11 months ago
- ☆283Updated this week
- ☆277Updated 3 months ago
- [ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…☆333Updated last week
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆604Updated last year
- 📚 Collection of awesome generation acceleration resources.☆356Updated 3 months ago
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆34Updated 8 months ago
- [ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers☆316Updated 2 months ago
- ☆192Updated last year
- Multimodal Models in Real World☆549Updated 8 months ago
- A list for Text-to-Video, Image-to-Video works☆243Updated 5 months ago
- A collection of diffusion models based on MindSpore☆161Updated last year
- GenEval: An object-focused framework for evaluating text-to-image alignment☆378Updated 7 months ago
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]☆310Updated last year
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,285Updated 2 weeks ago
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆200Updated 8 months ago
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆299Updated 3 months ago
- Official code of SmartEdit [CVPR-2024 Highlight]☆359Updated last year
- My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"☆262Updated this week
- [CVPR 2024] DeepCache: Accelerating Diffusion Models for Free☆935Updated last year
- ☆116Updated 2 years ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆411Updated 4 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆637Updated last year
- Code repository for T2V-Turbo and T2V-Turbo-v2☆303Updated 9 months ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆423Updated 11 months ago
- An initiative to replicate Sora☆104Updated last year
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆491Updated last year
- ☆471Updated 4 months ago
- Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distill…☆85Updated 4 months ago