ai-forever / KandinskyVideoLinks
KandinskyVideo — multilingual end-to-end text2video latent diffusion model
☆184Updated last year
Alternatives and similar repositories for KandinskyVideo
Users that are interested in KandinskyVideo are comparing it to the libraries listed below
Sorting:
- [TOG 2024]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter☆240Updated 2 months ago
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆174Updated last year
- I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models☆205Updated last year
- Text and image to video generation: Kandinsky 4.0 (2024)☆145Updated 6 months ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆244Updated 11 months ago
- AnimateDiff I2V version.☆186Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆158Updated last year
- Paint by Inpaint: Learning to Add Image Objects by Removing Them First☆107Updated 2 weeks ago
- Official implementation of the ECCV paper "SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"☆256Updated 8 months ago
- ☆143Updated 11 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆99Updated 11 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2…☆86Updated 2 years ago
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models☆512Updated last year
- Controlnet extension of AnimateDiff.☆53Updated last year
- [CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts☆298Updated last year
- Keyframe Interpolation with CogvideoX☆133Updated 7 months ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆411Updated 11 months ago
- ☆455Updated last year
- CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)☆338Updated 11 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆302Updated 4 months ago
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆244Updated last year
- Retrieval-Augmented Video Generation for Telling a Story☆257Updated last year
- Official implementations for paper: LivePhoto: Real Image Animation with Text-guided Motion Control☆189Updated last year
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆229Updated 11 months ago
- implementation of the IPAdapter models for HF Diffusers☆174Updated last year
- Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"☆350Updated last year
- A simple magic animate pipeline including densepose inference.☆37Updated last year
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆131Updated 2 months ago
- Create transparent image with Diffusers!☆55Updated 4 months ago