mindspore-lab / mindone
one for all, Optimal generator with No Exception
β367Updated this week
Related projects β
Alternatives and complementary repositories for mindone
- Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of π€Huggingface.β705Updated this week
- A collection of diffusion models based on MindSporeβ159Updated 9 months ago
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generationβ206Updated 2 weeks ago
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]β272Updated 6 months ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Modelsβ591Updated 2 weeks ago
- [CVPR2024 Highlight] VBench - We Evaluate Video Generationβ580Updated 2 weeks ago
- A collection of awesome text-to-image generation studies.β430Updated this week
- xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelismβ714Updated this week
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.β32Updated 8 months ago
- β100Updated 2 months ago
- A collection of awesome video generation studies.β349Updated this week
- Official code of SmartEdit [CVPR-2024 Highlight]β256Updated 5 months ago
- β349Updated last month
- Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidanceβ180Updated 3 months ago
- [CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Modelsβ207Updated last month
- β99Updated 8 months ago
- A reading list of video generationβ419Updated this week
- MindSpore online courses: Step into LLMβ430Updated 3 weeks ago
- π₯π₯First-ever hour scale video understanding modelsβ166Updated 3 weeks ago
- π₯π₯π₯ A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).β363Updated last week
- An initiative to replicate Soraβ99Updated 7 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachersβ527Updated 3 weeks ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Modelβ389Updated last week
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"β244Updated last month
- β254Updated 3 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Modelsβ143Updated last month
- A toolbox of vision models and algorithms based on MindSporeβ238Updated 3 weeks ago
- Let's finetune video generation models!β241Updated this week
- π This is a repository for organizing papers, codes and other resources related to unified multimodal models.β215Updated 2 weeks ago
- Official implementation of FouriScale (ECCV2024)β137Updated 3 months ago