ai-forever / KandinskyVideoLinks
KandinskyVideo — multilingual end-to-end text2video latent diffusion model
☆184Updated last year
Alternatives and similar repositories for KandinskyVideo
Users that are interested in KandinskyVideo are comparing it to the libraries listed below
Sorting:
- [TOG 2024]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter☆262Updated 8 months ago
- Paint by Inpaint: Learning to Add Image Objects by Removing Them First☆113Updated 6 months ago
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆234Updated last year
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆175Updated 2 years ago
- AnimateDiff I2V version.☆186Updated last year
- Official implementation of the ECCV paper "SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"☆265Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆162Updated last year
- Keyframe Interpolation with CogvideoX☆139Updated last year
- [NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance☆131Updated last year
- Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2…☆86Updated 2 years ago
- Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'☆256Updated 2 years ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆149Updated 11 months ago
- ☆143Updated last year
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆103Updated last year
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.☆99Updated last year
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Models☆133Updated 9 months ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation [TMLR 2024]☆255Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆131Updated last year
- Retrieval-Augmented Video Generation for Telling a Story☆259Updated last year
- [TMM 2025] StableIdentity: Inserting Anybody into Anywhere at First Sight 🔥☆260Updated 11 months ago
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆178Updated last year
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆420Updated 3 months ago
- I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models☆205Updated last year
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆244Updated last year
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆138Updated 7 months ago
- RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]☆313Updated 9 months ago
- [CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models☆292Updated 6 months ago
- [ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745☆250Updated 7 months ago
- [SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters☆268Updated last year
- Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"☆349Updated last year