[ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache
☆61Jan 26, 2026Updated 3 months ago
Alternatives and similar repositories for DiCache
Users that are interested in DiCache are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆41Nov 26, 2025Updated 5 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 8 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆60Jan 26, 2026Updated 3 months ago
- The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"☆269Nov 17, 2025Updated 6 months ago
- [ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"☆41Apr 19, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆204Apr 13, 2026Updated last month
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆86Sep 18, 2025Updated 8 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- [NeurIPS 2025, Spotlight]: Ambient-o: Training Good models with Bad Data.☆34Apr 6, 2026Updated last month
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆122Feb 13, 2026Updated 3 months ago
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆166Feb 16, 2026Updated 3 months ago
- RelightVid: Temporal-Consistent Diffusion Model for Video Relighting☆115Apr 2, 2025Updated last year
- [ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++☆221Apr 15, 2026Updated last month
- [NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆119Nov 3, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆120Feb 10, 2026Updated 3 months ago
- High performance inference engine for diffusion models☆108Sep 5, 2025Updated 8 months ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- ☆26Aug 12, 2025Updated 9 months ago
- Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distill…☆90Jun 26, 2025Updated 10 months ago
- Official repo for "IDArb: Intrinsic Decomposition for arbitrary number of input views and illuminations"☆97Jul 9, 2025Updated 10 months ago
- [NeurIPS 2023] Formulating Discrete Probability Flow Through Optimal Transport☆21Jan 8, 2024Updated 2 years ago
- A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.☆24Jul 18, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year
- Unified layout planning and image generation, ICCV2025☆45Jan 19, 2026Updated 4 months ago
- Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning☆268Feb 10, 2026Updated 3 months ago
- A PyTorch-native inference engine with cache, parallelism, quantization for Diffusion Transformers.☆1,174May 12, 2026Updated last week
- [CVPR 2025]Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction☆174Mar 23, 2025Updated last year
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆20Aug 3, 2025Updated 9 months ago
- ☆12Sep 11, 2023Updated 2 years ago
- CVPR 2026 Highlight: Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers☆78Apr 9, 2026Updated last month
- Official repository of MMDU dataset☆105Sep 29, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"☆211Apr 17, 2026Updated last month
- ☆19Updated this week
- [CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆77Sep 3, 2024Updated last year
- [WIP] Better (FP8) attention for Hopper☆33Feb 24, 2025Updated last year
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …☆109Sep 8, 2025Updated 8 months ago
- Official implementation of the benchmarked 2D, 3D classficiation, and 3D semantic segmentation models on PeRFception.☆14Jan 21, 2023Updated 3 years ago