DiT for VAE (and Video Generation)
☆35Sep 2, 2024Updated last year
Alternatives and similar repositories for CascadeV
Users that are interested in CascadeV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Mar 7, 2025Updated last year
- ☆20Jan 1, 2026Updated 2 months ago
- UniVid: The Open-Source Unified Video Model☆30Oct 13, 2025Updated 5 months ago
- ☆11Dec 15, 2025Updated 3 months ago
- faster parallel inference of mochi-1 video generation model☆125Feb 25, 2025Updated last year
- re-implementation of instantsplat (unofficial)☆16Aug 5, 2024Updated last year
- The official repository of EffiVED☆19Jun 5, 2024Updated last year
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆149Oct 9, 2025Updated 5 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆41Jul 23, 2025Updated 8 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆261Dec 27, 2024Updated last year
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆138Dec 18, 2025Updated 3 months ago
- ☆17Feb 20, 2025Updated last year
- Train LoRA using Microsoft's official implementation with Stable Diffusion models.☆33May 9, 2023Updated 2 years ago
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- Fashion-VDM: Video Diffusion Model for Virtual Try-On☆19Nov 4, 2024Updated last year
- An EinSum system in JAX☆18Mar 6, 2026Updated 2 weeks ago
- Finetune Stable Video Diffusion with Lora☆20Feb 3, 2024Updated 2 years ago
- Official implementation of "Self-Improving Video Generation"☆78Apr 25, 2025Updated 10 months ago
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆398Jan 19, 2025Updated last year
- Let's finetune video generation models!☆546Sep 15, 2025Updated 6 months ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆162Apr 7, 2024Updated last year
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".☆168Dec 11, 2025Updated 3 months ago
- This respository contains the code for the NeurIPS 2024 paper SF-V: Single Forward Video Generation Model.☆99Nov 27, 2024Updated last year
- ☆132Jun 24, 2025Updated 8 months ago
- ☆30Mar 4, 2025Updated last year
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆66Oct 16, 2024Updated last year
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆184Sep 26, 2024Updated last year
- Blending Custom Photos with Video Diffusion Transformers☆48Jan 21, 2025Updated last year
- [CVPR 2026] A training-free, mask-free framework for 3D shape editing.☆26Dec 12, 2025Updated 3 months ago
- Official pytorch implementation of "Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use"☆20Sep 16, 2025Updated 6 months ago
- A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration☆17Jul 22, 2022Updated 3 years ago
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆25Nov 29, 2025Updated 3 months ago
- Masters Degree☆13Nov 14, 2022Updated 3 years ago
- Scripts for running Ailiverse APIs☆10Jan 23, 2023Updated 3 years ago
- ☆52Dec 20, 2024Updated last year
- A parallelism VAE avoids OOM for high resolution image generation☆89Mar 12, 2026Updated last week
- VideoSys: An easy and efficient system for video generation☆2,020Aug 27, 2025Updated 6 months ago
- ☆11Mar 22, 2024Updated 2 years ago