[ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache
☆55Jan 26, 2026Updated last month
Alternatives and similar repositories for DiCache
Users that are interested in DiCache are comparing it to the libraries listed below
Sorting:
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Jan 21, 2025Updated last year
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆37Nov 26, 2025Updated 3 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆108Dec 31, 2025Updated 2 months ago
- A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.☆24Jul 18, 2025Updated 7 months ago
- [NeurIPS 2025, Spotlight]: Ambient-o: Training Good models with Bad Data.☆30Jan 21, 2026Updated last month
- ☆110Feb 10, 2026Updated 2 weeks ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆58Jan 26, 2026Updated last month
- [NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆115Nov 3, 2025Updated 3 months ago
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆20Aug 3, 2025Updated 6 months ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆47Oct 10, 2025Updated 4 months ago
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆85Sep 18, 2025Updated 5 months ago
- [NeurIPS 2023] Formulating Discrete Probability Flow Through Optimal Transport☆21Jan 8, 2024Updated 2 years ago
- [ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"☆39Jan 17, 2026Updated last month
- ☆25Aug 12, 2025Updated 6 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆35Aug 28, 2025Updated 6 months ago
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆117Feb 13, 2026Updated 2 weeks ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- RelightVid: Temporal-Consistent Diffusion Model for Video Relighting☆102Apr 2, 2025Updated 10 months ago
- [CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆112Updated this week
- The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"☆260Nov 17, 2025Updated 3 months ago
- ☆23Oct 15, 2024Updated last year
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆162Feb 16, 2026Updated 2 weeks ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- [ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++☆219Feb 2, 2026Updated 3 weeks ago
- Unified layout planning and image generation, ICCV2025☆40Jan 19, 2026Updated last month
- Longitudinal Evaluation of LLMs via Data Compression☆33May 29, 2024Updated last year
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆33Jun 30, 2025Updated 8 months ago
- 🤗 A PyTorch-native and Flexible Inference Engine with Hybrid Cache Acceleration and Parallelism for DiTs.☆1,058Updated this week
- CVPR 2026-Guiding a Diffusion Transformer with the Internal Dynamics of Itself (IG)☆60Updated this week
- 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …☆101Sep 8, 2025Updated 5 months ago
- Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning☆232Feb 10, 2026Updated 2 weeks ago
- [CVPR 2025]Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction☆168Mar 23, 2025Updated 11 months ago
- Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distill…☆90Jun 26, 2025Updated 8 months ago
- Official repo for "IDArb: Intrinsic Decomposition for arbitrary number of input views and illuminations"☆95Jul 9, 2025Updated 7 months ago
- tokviz is a Python library for visualizing tokenization patterns across different language models.☆12Apr 25, 2024Updated last year
- ☆17Feb 4, 2026Updated 3 weeks ago
- [ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"☆189Feb 8, 2026Updated 3 weeks ago