dInfer: An Efficient Inference Framework for Diffusion Language Models
☆434Feb 11, 2026Updated last month
Alternatives and similar repositories for dInfer
Users that are interested in dInfer are comparing it to the libraries listed below
Sorting:
- Easy and Efficient dLLM Fine-Tuning☆235Mar 2, 2026Updated 2 weeks ago
- A lightweight Inference Engine built for block diffusion models☆42Dec 9, 2025Updated 3 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture. Training an MDM using GPT with this repo!☆35Jun 23, 2025Updated 8 months ago
- ☆44Sep 8, 2025Updated 6 months ago
- d3LLM: Ultra-Fast Diffusion LLM 🚀☆110Mar 15, 2026Updated last week
- DeeperGEMM: crazy optimized version☆75May 5, 2025Updated 10 months ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆464Jan 28, 2026Updated last month
- LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.☆374Feb 12, 2026Updated last month
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆883Jan 28, 2026Updated last month
- diffusers with search engine☆12Jan 13, 2026Updated 2 months ago
- The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".☆871Mar 10, 2026Updated last week
- ☆18Feb 23, 2026Updated 3 weeks ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,682Nov 12, 2025Updated 4 months ago
- ☆55Jun 4, 2025Updated 9 months ago
- Model souping for LLMs☆72Nov 18, 2025Updated 4 months ago
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆123Dec 25, 2025Updated 2 months ago
- A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.☆24Jul 18, 2025Updated 8 months ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- 武汉大学本科毕设代码--图联邦学习系统设计与实现☆14Jun 5, 2023Updated 2 years ago
- Guichan is a C++ GUI library designed for games.☆14Oct 22, 2025Updated 5 months ago
- ☆36Mar 7, 2025Updated last year
- Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools☆174Mar 12, 2026Updated last week
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆13Apr 3, 2025Updated 11 months ago
- ☆52May 19, 2025Updated 10 months ago
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆27Dec 17, 2024Updated last year
- [NeurIPS 2025] Multipole Attention for Efficient Long Context Reasoning☆22Dec 5, 2025Updated 3 months ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆246Feb 3, 2026Updated last month
- Repo for SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting (ISCA25)☆71Apr 25, 2025Updated 10 months ago
- Code accompanying the NeurIPS 2019 paper AutoAssist: A Framework to Accelerate Training of Deep Neural Networks.☆14Oct 3, 2022Updated 3 years ago
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆31Jan 27, 2026Updated last month
- High-performance distributed data shuffling (all-to-all) library for MoE training and inference☆114Mar 7, 2026Updated 2 weeks ago
- A Secure Version of DATAVIEW using SGX techniques.☆10Jul 6, 2021Updated 4 years ago
- ☆40Jan 16, 2026Updated 2 months ago
- ☆15Dec 2, 2019Updated 6 years ago
- How to plot for papers, slides, demos, etc.☆10Apr 7, 2022Updated 3 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Apr 29, 2025Updated 10 months ago
- Azərbaycan dilində informatika, proqramlaşdırma və kompüter elmləri haqqında açıq və ictimai resurs platforması.☆45Mar 12, 2026Updated last week