ai-forever / KandinskyVideo
KandinskyVideo — multilingual end-to-end text2video latent diffusion model
☆185Updated 11 months ago
Alternatives and similar repositories for KandinskyVideo:
Users that are interested in KandinskyVideo are comparing it to the libraries listed below
- Text and image to video generation: Kandinsky 4.0 (2024)☆144Updated 4 months ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆242Updated 10 months ago
- [TOG 2024]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter☆235Updated last month
- I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models☆206Updated last year
- Paint by Inpaint: Learning to Add Image Objects by Removing Them First☆101Updated last month
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆158Updated last year
- ☆143Updated 10 months ago
- [CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models☆268Updated 3 weeks ago
- Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""☆174Updated last year
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling☆162Updated 7 months ago
- AnimateDiff I2V version.☆186Updated last year
- [CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts☆296Updated 10 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆299Updated 3 months ago
- Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)☆220Updated 3 months ago
- Official implementation of the ECCV paper "SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"☆251Updated 6 months ago
- Official implementations for paper: LivePhoto: Real Image Animation with Text-guided Motion Control☆188Updated last year
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.☆96Updated 5 months ago
- Keyframe Interpolation with CogvideoX☆127Updated 6 months ago
- Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2…☆86Updated 2 years ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆116Updated 3 months ago
- [NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance☆125Updated 6 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆99Updated 10 months ago
- ☆110Updated last year
- Controlnet extension of AnimateDiff.☆52Updated last year
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆225Updated 9 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆273Updated last month
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆406Updated 9 months ago
- ☆272Updated 9 months ago
- ☆452Updated last year