THUDM / CogKitLinks
Finetuning and inference tools for the CogView4 and CogVideoX model series.
☆70Updated 3 weeks ago
Alternatives and similar repositories for CogKit
Users that are interested in CogKit are comparing it to the libraries listed below
Sorting:
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆117Updated 4 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆47Updated 3 weeks ago
- Blending Custom Photos with Video Diffusion Transformers☆47Updated 4 months ago
- [AAAI-2025] Official implementation of Image Conductor: Precision Control for Interactive Video Synthesis☆93Updated 10 months ago
- Subjects200K dataset☆111Updated 4 months ago
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with…☆126Updated 7 months ago
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆61Updated 2 weeks ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆132Updated 7 months ago
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆343Updated this week
- [Arxiv 2024] Edicho: Consistent Image Editing in the Wild☆118Updated 4 months ago
- [ARXIV'24] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆114Updated 2 months ago
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling☆169Updated 8 months ago
- [Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing☆90Updated 2 months ago
- ☆83Updated last year
- Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carving☆183Updated last week
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆69Updated 5 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆103Updated last year
- ☆50Updated 5 months ago
- [IJCAI 2025] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models"…☆87Updated 3 weeks ago
- MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance☆120Updated last month
- [CVPR2024] Official code for Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation☆87Updated last year
- EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆58Updated 2 months ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆55Updated last month
- ☆102Updated 11 months ago
- Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆55Updated 8 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆62Updated 3 months ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated last year
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆65Updated last month
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆25Updated 5 months ago
- This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation☆41Updated 2 months ago