(ToCa-v2) A New version of ToCa,with faster speed and better acceleration!
☆41Mar 13, 2025Updated last year
Alternatives and similar repositories for DuCa
Users that are interested in DuCa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching☆216Mar 14, 2025Updated last year
- (ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆62Sep 17, 2025Updated 7 months ago
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆55Jul 8, 2024Updated last year
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆51Jun 6, 2025Updated 10 months ago
- ☆60Jan 29, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An official repository for GPTailor☆17Jun 29, 2025Updated 10 months ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Jul 24, 2024Updated last year
- 📚 Collection of token-level model compression resources.☆195Sep 3, 2025Updated 8 months ago
- ☆15Feb 21, 2024Updated 2 years ago
- ☆20Jun 10, 2025Updated 10 months ago
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model☆1,317Jun 8, 2025Updated 10 months ago
- The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"☆267Nov 17, 2025Updated 5 months ago
- ☆45Mar 15, 2024Updated 2 years ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆261Dec 27, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository contains Matlab codes associated with the paper entitled "Kinematic Modeling and Trajectory Tracking Control of an Octopu…☆11Apr 19, 2021Updated 5 years ago
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 10 months ago
- ☆33May 26, 2024Updated last year
- Chain of Images for Intuitively Reasoning☆10Nov 29, 2023Updated 2 years ago
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆426Jul 5, 2025Updated 10 months ago
- ☆24May 21, 2025Updated 11 months ago
- [NeurIPS 2024] The official repository of "Distribution-Aware Data Expansion with Diffusion Models".☆17Dec 15, 2025Updated 4 months ago
- [NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"☆352Mar 16, 2025Updated last year
- An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional var…☆152Apr 10, 2026Updated 3 weeks ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"☆22Dec 8, 2024Updated last year
- Make Your Training Flexible: Towards Deployment-Efficient Video Models☆39Jun 11, 2025Updated 10 months ago
- ☆14Mar 23, 2025Updated last year
- [ACL 2025] Can MLLMs Understand the Deep Implication Behind Chinese Images?☆23Apr 9, 2026Updated 3 weeks ago
- Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…☆28Jan 21, 2025Updated last year
- SAH-SCI: Self-Supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging☆15Oct 24, 2024Updated last year
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated 2 years ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆21Feb 14, 2025Updated last year
- The code repository of UniRL☆52May 30, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The code of “DreamFuse: Adaptive Image Fusion with Diffusion Transformer”.☆27Jul 25, 2025Updated 9 months ago
- [Roadmap] Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling☆71Updated this week
- ☆95Feb 14, 2026Updated 2 months ago
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆165Dec 1, 2025Updated 5 months ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Oct 22, 2025Updated 6 months ago
- ☆13Apr 18, 2024Updated 2 years ago
- ☆19Oct 12, 2023Updated 2 years ago