DiT for VAE (and Video Generation)
☆35Sep 2, 2024Updated last year
Alternatives and similar repositories for CascadeV
Users that are interested in CascadeV are comparing it to the libraries listed below
Sorting:
- ☆20Jan 1, 2026Updated 2 months ago
- ☆22Mar 7, 2025Updated 11 months ago
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated last year
- ☆11Dec 15, 2025Updated 2 months ago
- The official repository of EffiVED☆19Jun 5, 2024Updated last year
- All tools developed by myself for personal purposes.☆16Feb 1, 2026Updated last month
- MCP prompt tool applying Chain-of-Draft (CoD) reasoning - BYOLLM☆18Sep 8, 2025Updated 5 months ago
- UniVid: The Open-Source Unified Video Model☆30Oct 13, 2025Updated 4 months ago
- 🚀 原生使用 Deepspeed 训练 Diffusers | Native Training of Diffusers with Deepspeed☆19Jan 19, 2025Updated last year
- faster parallel inference of mochi-1 video generation model☆125Feb 25, 2025Updated last year
- re-implementation of instantsplat (unofficial)☆16Aug 5, 2024Updated last year
- Fashion-VDM: Video Diffusion Model for Virtual Try-On☆19Nov 4, 2024Updated last year
- ☆17Feb 20, 2025Updated last year
- Blending Custom Photos with Video Diffusion Transformers☆48Jan 21, 2025Updated last year
- ☆28Mar 4, 2025Updated 11 months ago
- Finetune Stable Video Diffusion with Lora☆19Feb 3, 2024Updated 2 years ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆148Oct 9, 2025Updated 4 months ago
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆41Jul 23, 2025Updated 7 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.☆99Nov 27, 2024Updated last year
- Let's finetune video generation models!☆543Sep 15, 2025Updated 5 months ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆134Dec 18, 2025Updated 2 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆65Oct 16, 2024Updated last year
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".☆166Dec 11, 2025Updated 2 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆259Dec 27, 2024Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆162Apr 7, 2024Updated last year
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆176Sep 26, 2024Updated last year
- Train LoRA using Microsoft's official implementation with Stable Diffusion models.☆33May 9, 2023Updated 2 years ago
- The official PyTorch Implementation of Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment☆41Dec 4, 2025Updated 2 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆26Dec 15, 2024Updated last year
- [ECCV24] Official code for RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting☆32Sep 3, 2024Updated last year
- ☆33Jan 6, 2025Updated last year
- EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing☆30Mar 29, 2024Updated last year
- Official Repo for Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation☆30Mar 29, 2024Updated last year
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆33Nov 29, 2024Updated last year
- A parallelism VAE avoids OOM for high resolution image generation☆85Aug 4, 2025Updated 6 months ago
- ☆34May 14, 2025Updated 9 months ago
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆566Sep 16, 2024Updated last year