Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" (ICML 2025)
☆82May 22, 2025Updated 9 months ago
Alternatives and similar repositories for CausalCache-VDM
Users that are interested in CausalCache-VDM are comparing it to the libraries listed below
Sorting:
- ☆43Nov 22, 2023Updated 2 years ago
- ☆17Jul 30, 2024Updated last year
- ☆213Feb 11, 2025Updated last year
- This is the official repo for the paper "Accelerating Parallel Sampling of Diffusion Models" Tang et al. ICML 2024 https://openreview.net…☆16Jul 19, 2024Updated last year
- Official Github Repo for Neurips 2024 Paper Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment☆63Jun 7, 2025Updated 8 months ago
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆52Jul 8, 2024Updated last year
- [arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance☆41Feb 19, 2025Updated last year
- The official implementation of "2025ICLR Dynamic Diffusion Transformer" and "2025ArXivDyDiT++: Dynamic Diffusion Transformers for Efficie…☆47Apr 10, 2025Updated 10 months ago
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆630Jul 1, 2025Updated 8 months ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- [CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models☆295May 17, 2025Updated 9 months ago
- ☆31Sep 1, 2025Updated 6 months ago
- LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement☆75Jul 29, 2024Updated last year
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆48Dec 14, 2024Updated last year
- [SIGGRAPH Asia 2024] TrailBlazer: Trajectory Control for Diffusion-Based Video Generation☆100May 31, 2024Updated last year
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆629Oct 29, 2025Updated 4 months ago
- ☆30Nov 7, 2023Updated 2 years ago
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆14Apr 2, 2025Updated 11 months ago
- ☆15Sep 23, 2024Updated last year
- Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting☆14Dec 19, 2025Updated 2 months ago
- ☆11Mar 3, 2025Updated last year
- Responsible Visual Editing☆15Jul 10, 2024Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆309Mar 12, 2025Updated 11 months ago
- official code for Diff-Instruct algorithm for one-step diffusion distillation☆86Jan 9, 2025Updated last year
- [NeurIPS 2025] Reward-Instruct: A Reward-Centric Approach to Fast Photo-Realistic Image Generation☆34Oct 24, 2025Updated 4 months ago
- ☆92Jul 11, 2025Updated 7 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆97Mar 12, 2025Updated 11 months ago
- ☆28Sep 4, 2025Updated 6 months ago
- Demo for Qwen2.5-VL-3B-Instruct on Axera device.☆17Sep 3, 2025Updated 6 months ago
- MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation☆31Apr 3, 2024Updated last year
- Real Time High-Fidelity Faceswap☆14Aug 21, 2024Updated last year
- [NeurIPS D&B Track 2024] Official implementation of HumanVid☆346Oct 14, 2025Updated 4 months ago
- ☆415Mar 10, 2025Updated 11 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆674Oct 25, 2024Updated last year
- Official Implementation of VideoDPO☆160Jun 1, 2025Updated 9 months ago
- This repo contains the code for 1D tokenizer and generator☆1,117Mar 20, 2025Updated 11 months ago
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆31Mar 11, 2025Updated 11 months ago
- Code of RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images☆93Nov 9, 2024Updated last year