kandinskylab / kandinsky-5Links
Kandinsky 5.0: A family of diffusion models for Video & Image generation
☆579Updated last week
Alternatives and similar repositories for kandinsky-5
Users that are interested in kandinsky-5 are comparing it to the libraries listed below
Sorting:
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆663Updated 2 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆668Updated 3 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆398Updated 3 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆423Updated last month
- ☆381Updated 5 months ago
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆657Updated 3 weeks ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆539Updated last month
- Text and image to video generation: Kandinsky 4.0 (2024)☆149Updated last year
- ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆624Updated last month
- Scale-wise Distillation of Diffusion Models☆113Updated 3 months ago
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆208Updated last month
- Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".☆194Updated 8 months ago
- Taming large-scale full-parameter few-step training with self-adversarial flows! 👏🏻☆310Updated last week
- DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space☆313Updated 2 months ago
- [ICML2025] An 8-step inversion and 8-step editing process works effectively with the FLUX-dev model. (3x speedup with results that are co…☆285Updated 7 months ago
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆454Updated 9 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆270Updated 6 months ago
- Tiny AutoEncoder for Hunyuan Video (and other video models)☆250Updated last week
- [ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation☆361Updated 7 months ago
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additi…☆318Updated 4 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆181Updated 5 months ago
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆445Updated 3 weeks ago
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆200Updated 5 months ago
- [CVPR 2025] Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models☆337Updated last month
- ☆172Updated 3 months ago
- [ICCV 2025] LayerAnimate: Layer-specific Control for Animation☆193Updated 4 months ago
- iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation☆177Updated 3 weeks ago
- Inference-time scaling of diffusion-based image and video generation models.☆172Updated last week
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆238Updated 4 months ago
- Calligrapher: Freestyle Text Image Customization☆295Updated 3 months ago