text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
☆12,449Nov 4, 2025Updated 3 months ago
Alternatives and similar repositories for CogVideo
Users that are interested in CogVideo are comparing it to the libraries listed below
Sorting:
- Open-Sora: Democratizing Efficient Video Production for All☆28,604Apr 30, 2025Updated 10 months ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆11,780Nov 21, 2025Updated 3 months ago
- Official implementation of AnimateDiff.☆12,038Jul 31, 2024Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,032Jan 9, 2026Updated last month
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,250Mar 6, 2025Updated 11 months ago
- The best OSS video generation models, created by Genmo☆3,604Nov 14, 2025Updated 3 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,161Dec 21, 2024Updated last year
- Official repository for LTX-Video☆9,367Jan 5, 2026Updated last month
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,996Sep 8, 2024Updated last year
- Official inference repo for FLUX.1 models☆25,225Jul 31, 2025Updated 7 months ago
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,134Oct 29, 2025Updated 4 months ago
- Enjoy the magic of Diffusion models!☆11,826Feb 15, 2026Updated 2 weeks ago
- Wan: Open and Advanced Large-Scale Video Generative Models☆15,434Dec 15, 2025Updated 2 months ago
- Scalable and memory-optimized training of diffusion models☆1,338Jun 4, 2025Updated 8 months ago
- A curated list of recent diffusion models for video generation, editing, and various other applications.☆5,466Updated this week
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆1,912Updated this week
- Generative Models by Stability AI☆26,930Dec 16, 2025Updated 2 months ago
- VideoSys: An easy and efficient system for video generation☆2,016Aug 27, 2025Updated 6 months ago
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆32,873Updated this week
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,153Jan 10, 2025Updated last year
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,293Nov 27, 2025Updated 3 months ago
- More relighting!☆8,375Feb 20, 2025Updated last year
- Bring portraits to life!☆17,833Nov 16, 2025Updated 3 months ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,387Sep 26, 2024Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,471Jun 28, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,251Feb 16, 2025Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,382May 31, 2024Updated last year
- Let us control diffusion models!☆33,663Feb 25, 2024Updated 2 years ago
- A unified inference and post-training framework for accelerated video generation.☆3,111Updated this week
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,918Oct 30, 2025Updated 4 months ago
- [ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators☆4,244May 6, 2023Updated 2 years ago
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,633Sep 25, 2024Updated last year
- Kolors Team☆4,597Nov 13, 2024Updated last year
- Next-Token Prediction is All You Need☆2,350Jan 12, 2026Updated last month
- MAGI-1: Autoregressive Video Generation at Scale☆3,643Jun 17, 2025Updated 8 months ago
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆23,942Updated this week
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,969Updated this week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,530Nov 18, 2025Updated 3 months ago