kandinskylab / kandinsky-5Links
Kandinsky 5.0: A family of diffusion models for Video & Image generation
☆682Updated 2 weeks ago
Alternatives and similar repositories for kandinsky-5
Users that are interested in kandinsky-5 are comparing it to the libraries listed below
Sorting:
- Pusa: Thousands Timesteps Video Diffusion Model☆671Updated 4 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆669Updated 3 months ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆552Updated 2 months ago
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆684Updated last month
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆200Updated 6 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆407Updated 4 months ago
- Taming large-scale few-step training with self-adversarial flows! 👏🏻☆438Updated this week
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆269Updated 7 months ago
- ☆382Updated 6 months ago
- Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".☆193Updated 9 months ago
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆210Updated 2 months ago
- Scale-wise Distillation of Diffusion Models☆113Updated 3 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆449Updated 2 months ago
- Tiny AutoEncoder for Hunyuan Video (and other video models)☆263Updated 3 weeks ago
- DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space☆327Updated 3 months ago
- iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation☆183Updated last month
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additi…☆322Updated 4 months ago
- ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆649Updated last month
- ☆368Updated 9 months ago
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆454Updated 10 months ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆149Updated last year
- Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.☆863Updated 4 months ago
- ☆554Updated 3 weeks ago
- Calligrapher: Freestyle Text Image Customization☆294Updated 4 months ago
- Wan2.2-Lightning: Speed up wan2.2 model with distillation☆254Updated 2 months ago
- Repo for SeedVR2 & SeedVR (CVPR2025 Highlight)☆893Updated 6 months ago
- Community trainer for Lightricks' LTX Video model 🎬 ⚡️☆385Updated last week
- [ICML2025] An 8-step inversion and 8-step editing process works effectively with the FLUX-dev model. (3x speedup with results that are co…☆287Updated 8 months ago
- Qwen-Image text to image lora trainer☆672Updated 3 weeks ago
- [ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation☆365Updated 7 months ago