kandinskylab / kandinsky-5Links
Kandinsky 5.0: A family of diffusion models for Video & Image generation
☆705Updated last week
Alternatives and similar repositories for kandinsky-5
Users that are interested in kandinsky-5 are comparing it to the libraries listed below
Sorting:
- Pusa: Thousands Timesteps Video Diffusion Model☆672Updated last week
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆672Updated 3 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆271Updated 8 months ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆566Updated 3 months ago
- [ICLR 2026] Taming large-scale few-step training with self-adversarial flows! 👏🏻☆476Updated 2 weeks ago
- ☆388Updated 7 months ago
- Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".☆195Updated 10 months ago
- [ICLR 2026] ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆668Updated 2 months ago
- [ICLR'2026] Scale-wise Distillation of Diffusion Models☆113Updated 4 months ago
- [ICML2025] An 8-step inversion and 8-step editing process works effectively with the FLUX-dev model. (3x speedup with results that are co…☆289Updated 9 months ago
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆715Updated 2 months ago
- Tiny AutoEncoder for Hunyuan Video (and other video models)☆297Updated this week
- Calligrapher: Freestyle Text Image Customization☆296Updated 5 months ago
- Qwen-Image text to image lora trainer☆701Updated last month
- Krea Realtime 14B. An open-source realtime AI video model.☆485Updated 3 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆420Updated 5 months ago
- ☆328Updated 4 months ago
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆215Updated 3 months ago
- [CVPR 2025] Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models☆354Updated 2 months ago
- Wan2.2-Lightning: Speed up wan2.2 model with distillation☆266Updated 3 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆184Updated 6 months ago
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆200Updated 7 months ago
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆449Updated 2 months ago
- DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space☆343Updated 4 months ago
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆456Updated 11 months ago
- ☆370Updated 10 months ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆149Updated last year
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additi…☆322Updated 5 months ago
- Community trainer for Lightricks' LTX Video model 🎬 ⚡️☆401Updated last month
- [ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos☆425Updated this week