[ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache
☆58Jan 26, 2026Updated last month
Alternatives and similar repositories for DiCache
Users that are interested in DiCache are comparing it to the libraries listed below
Sorting:
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Jan 21, 2025Updated last year
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆39Nov 26, 2025Updated 3 months ago
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆109Dec 31, 2025Updated 2 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆35Aug 28, 2025Updated 6 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆59Jan 26, 2026Updated last month
- The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"☆264Nov 17, 2025Updated 4 months ago
- [ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"☆40Jan 17, 2026Updated 2 months ago
- [ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆186Feb 4, 2026Updated last month
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆86Sep 18, 2025Updated 6 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆48Oct 10, 2025Updated 5 months ago
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆119Feb 13, 2026Updated last month
- [NeurIPS 2025, Spotlight]: Ambient-o: Training Good models with Bad Data.☆33Jan 21, 2026Updated 2 months ago
- RelightVid: Temporal-Consistent Diffusion Model for Video Relighting☆109Apr 2, 2025Updated 11 months ago
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆27Jun 5, 2025Updated 9 months ago
- [ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++☆219Feb 2, 2026Updated last month
- [NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆117Nov 3, 2025Updated 4 months ago
- ☆114Feb 10, 2026Updated last month
- High performance inference engine for diffusion models☆107Sep 5, 2025Updated 6 months ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- ☆26Aug 12, 2025Updated 7 months ago
- Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distill…☆90Jun 26, 2025Updated 8 months ago
- Unified layout planning and image generation, ICCV2025☆41Jan 19, 2026Updated 2 months ago
- Official repo for "IDArb: Intrinsic Decomposition for arbitrary number of input views and illuminations"☆97Jul 9, 2025Updated 8 months ago
- [NeurIPS 2023] Formulating Discrete Probability Flow Through Optimal Transport☆21Jan 8, 2024Updated 2 years ago
- A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.☆24Jul 18, 2025Updated 8 months ago
- Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning☆247Feb 10, 2026Updated last month
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year
- A PyTorch-native inference engine with hybrid cache acceleration and massive parallelism for DiTs.☆1,102Updated this week
- [CVPR 2025]Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction☆168Mar 23, 2025Updated 11 months ago
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆19Aug 3, 2025Updated 7 months ago
- ☆12Sep 11, 2023Updated 2 years ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆20Mar 8, 2026Updated last week
- [ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"☆194Feb 8, 2026Updated last month
- [ACM MM2025] The official repository for the RealSyn dataset☆40Dec 14, 2025Updated 3 months ago
- ☆18Feb 4, 2026Updated last month
- 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …☆103Sep 8, 2025Updated 6 months ago
- [WIP] Better (FP8) attention for Hopper☆32Feb 24, 2025Updated last year