THUDM / CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
☆10,303Updated this week
Alternatives and similar repositories for CogVideo:
Users that are interested in CogVideo are comparing it to the libraries listed below
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆5,547Updated 6 months ago
- Official implementation of AnimateDiff.☆10,863Updated 5 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,313Updated 6 months ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,116Updated 3 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆23,110Updated 3 weeks ago
- Kolors Team☆4,108Updated 2 months ago
- [ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators☆4,104Updated last year
- Various AI scripts. Mostly Stable Diffusion stuff.☆3,817Updated 2 weeks ago
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆13,445Updated this week
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆3,805Updated this week
- Bring portraits to life!☆13,655Updated 2 weeks ago
- Official inference repo for FLUX.1 models☆19,466Updated last week
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆4,654Updated 6 months ago
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,701Updated 3 weeks ago
- Generative Models by Stability AI☆25,088Updated 4 months ago
- More relighting!☆7,348Updated last month
- Enjoy the magic of Diffusion models!☆6,742Updated this week
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.☆27,123Updated this week
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,479Updated 6 months ago
- Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)☆3,350Updated 10 months ago
- The best OSS video generation models☆2,718Updated last week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,092Updated 3 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,680Updated this week
- High-Resolution Image Synthesis with Latent Diffusion Models☆12,225Updated 10 months ago
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆9,865Updated last month
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆2,982Updated last month
- StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,481Updated last month
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,931Updated 2 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,717Updated 4 months ago
- [NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image☆3,201Updated 3 weeks ago