ExponentialML / Text-To-Video-FinetuningLinks
Finetune ModelScope's Text To Video model using Diffusers π§¨
β687Updated last year
Alternatives and similar repositories for Text-To-Video-Finetuning
Users that are interested in Text-To-Video-Finetuning are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"β834Updated last year
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllabilityβ939Updated last year
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024β753Updated last year
- Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"β397Updated 2 years ago
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Modelsβ514Updated last year
- Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)β583Updated last year
- [ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Predictionβ941Updated 8 months ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Modelsβ934Updated 8 months ago
- The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".β298Updated last year
- [ICCV 2023] Consistent Image Synthesis and Editingβ802Updated 10 months ago
- Transfer the ControlNet with any basemodel in diffusersπ₯β833Updated 2 years ago
- Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"β1,008Updated last year
- Video-P2P: Video Editing with Cross-attention Controlβ413Updated 2 weeks ago
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Informationβ603Updated 11 months ago
- β¨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XLβ1,103Updated last year
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generationβ487Updated 8 months ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" β¦β1,042Updated last year
- [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"β1,148Updated last year
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)β539Updated last year
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attentionβ706Updated 6 months ago
- [SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Dataβ644Updated 8 months ago
- [NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Modelsβ650Updated last year
- ICLR 2024 (Spotlight)β772Updated last year
- Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approachβ466Updated last year
- β452Updated 2 months ago
- Make-A-Protagonist: Generic Video Editing with An Ensemble of Expertsβ324Updated last year
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editorβ516Updated last year
- β468Updated 3 weeks ago
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)β889Updated 2 years ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafterβ413Updated last year