THUDM / CogKit
Finetuning and inference tools for the CogView4 and CogVideoX model series.
☆58Updated last week
Alternatives and similar repositories for CogKit
Users that are interested in CogKit are comparing it to the libraries listed below
Sorting:
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆116Updated 4 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆41Updated last week
- [ARXIV'24] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆107Updated last month
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆59Updated 2 months ago
- ☆26Updated last month
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆64Updated last month
- Blending Custom Photos with Video Diffusion Transformers☆46Updated 3 months ago
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with…☆124Updated 7 months ago
- [Arxiv 2024] Edicho: Consistent Image Editing in the Wild☆118Updated 4 months ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆105Updated 3 weeks ago
- ☆67Updated 11 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆130Updated 7 months ago
- [AAAI-2025] Official implementation of Image Conductor: Precision Control for Interactive Video Synthesis☆91Updated 9 months ago
- ☆103Updated 10 months ago
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning☆86Updated last month
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated last year
- Subjects200K dataset☆110Updated 3 months ago
- ☆47Updated 5 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆109Updated 3 months ago
- Official repo for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆108Updated last week
- ☆61Updated 5 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆58Updated this week
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆68Updated 4 months ago
- [CVPR2024] Official code for Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation☆86Updated last year
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling☆163Updated 7 months ago
- Code for FreeScale, a tuning-free method for higher-resolution visual generation☆126Updated 2 months ago
- ☆95Updated 10 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)☆114Updated 9 months ago
- [Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing☆89Updated last month
- Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆55Updated 8 months ago