Finetuning and inference tools for the CogView4 and CogVideoX model series.
☆127May 14, 2025Updated last year
Alternatives and similar repositories for CogKit
Users that are interested in CogKit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆13Apr 2, 2025Updated last year
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆29Apr 3, 2026Updated last month
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆66May 7, 2025Updated last year
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆64Jul 31, 2025Updated 9 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆90May 12, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Scalable and memory-optimized training of diffusion models☆1,359Apr 8, 2026Updated last month
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory☆67Jan 13, 2026Updated 4 months ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 6 months ago
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆459Mar 5, 2025Updated last year
- ☆18Oct 24, 2024Updated last year
- CogView4, CogView3-Plus and CogView3(ECCV 2024)☆1,103Mar 29, 2025Updated last year
- Simple Controlnet module for CogvideoX model.☆182Jan 12, 2025Updated last year
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,624Mar 23, 2026Updated last month
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation☆268Jan 30, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆30Aug 21, 2025Updated 9 months ago
- Physics-based rigging with MPM for realistic character animation. ICCV 2025.☆86Apr 15, 2026Updated last month
- Let's finetune video generation models!☆545Sep 15, 2025Updated 8 months ago
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆41Jul 23, 2025Updated 9 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆24May 14, 2026Updated last week
- Official repository for "PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms"☆41Dec 18, 2025Updated 5 months ago
- [ICCV 2025] Official repository of DiffSim: Taming Diffusion Models for Evaluating Visual Similarity☆31Jul 14, 2025Updated 10 months ago
- Official repo of "Guide3D: Create 3D Avatars from Text and Image Guidance"☆39Aug 23, 2023Updated 2 years ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is a LoRA model finetuned on Wan-I2V-14B-480P. It turns things in the image into fluffy toys.☆19Nov 10, 2025Updated 6 months ago
- [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step☆352Jul 4, 2025Updated 10 months ago
- [ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image☆22Sep 15, 2025Updated 8 months ago
- ☆652May 24, 2024Updated last year
- Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Vi…☆248Mar 19, 2025Updated last year
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆586Jun 5, 2025Updated 11 months ago
- Video Diffusion Transformers are In-Context Learners☆36Jan 6, 2025Updated last year
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆110Mar 19, 2025Updated last year
- ☆10Nov 18, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆52Dec 13, 2024Updated last year
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆874Dec 23, 2025Updated 4 months ago
- [ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆119Apr 21, 2026Updated last month
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 3 months ago
- ☆25Jul 20, 2025Updated 10 months ago
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆667Mar 6, 2026Updated 2 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆648Oct 29, 2025Updated 6 months ago