Finetuning and inference tools for the CogView4 and CogVideoX model series.
☆123May 14, 2025Updated 10 months ago
Alternatives and similar repositories for CogKit
Users that are interested in CogKit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆14Apr 2, 2025Updated last year
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆29Apr 3, 2026Updated last week
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆65May 7, 2025Updated 11 months ago
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆64Jul 31, 2025Updated 8 months ago
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory☆63Jan 13, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Scalable and memory-optimized training of diffusion models☆1,348Apr 2, 2026Updated last week
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆90May 12, 2025Updated 10 months ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 5 months ago
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆458Mar 5, 2025Updated last year
- ☆18Oct 24, 2024Updated last year
- CogView4, CogView3-Plus and CogView3(ECCV 2024)☆1,103Mar 29, 2025Updated last year
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,562Mar 23, 2026Updated 2 weeks ago
- Simple Controlnet module for CogvideoX model.☆181Jan 12, 2025Updated last year
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation☆266Jan 30, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆30Aug 21, 2025Updated 7 months ago
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆41Jul 23, 2025Updated 8 months ago
- Physics-based rigging with MPM for realistic character animation. ICCV 2025.☆84Mar 27, 2026Updated 2 weeks ago
- Let's finetune video generation models!☆547Sep 15, 2025Updated 6 months ago
- Official repository for "PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms"☆37Dec 18, 2025Updated 3 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆22Mar 25, 2026Updated 2 weeks ago
- [ICCV 2025] Official repository of DiffSim: Taming Diffusion Models for Evaluating Visual Similarity☆31Jul 14, 2025Updated 8 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- Official repo of "Guide3D: Create 3D Avatars from Text and Image Guidance"☆39Aug 23, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is a LoRA model finetuned on Wan-I2V-14B-480P. It turns things in the image into fluffy toys.☆19Nov 10, 2025Updated 5 months ago
- [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step☆352Jul 4, 2025Updated 9 months ago
- [ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image☆22Sep 15, 2025Updated 6 months ago
- ☆645May 24, 2024Updated last year
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Vi…☆242Mar 19, 2025Updated last year
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆586Jun 5, 2025Updated 10 months ago
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆862Dec 23, 2025Updated 3 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆109Mar 19, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Nov 18, 2024Updated last year
- ☆52Dec 13, 2024Updated last year
- [ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆114Feb 6, 2026Updated 2 months ago
- ☆35Dec 20, 2023Updated 2 years ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 2 months ago
- ☆23Jul 20, 2025Updated 8 months ago
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆655Mar 6, 2026Updated last month