dInfer: An Efficient Inference Framework for Diffusion Language Models
☆449Feb 11, 2026Updated 2 months ago
Alternatives and similar repositories for dInfer
Users that are interested in dInfer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Easy and Efficient dLLM Fine-Tuning☆239Mar 2, 2026Updated last month
- A lightweight Inference Engine built for block diffusion models☆43Dec 9, 2025Updated 4 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 6 months ago
- ☆46Sep 8, 2025Updated 7 months ago
- SGLang Kernel Wheel Index☆18Apr 3, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICML25] CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale☆25Jul 31, 2025Updated 8 months ago
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated 11 months ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆486Jan 28, 2026Updated 2 months ago
- The official repo for the code and data of paper SMART☆40Feb 20, 2025Updated last year
- LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.☆392Feb 12, 2026Updated last month
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆915Updated this week
- diffusers with search engine☆12Jan 13, 2026Updated 2 months ago
- The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".☆924Mar 10, 2026Updated last month
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,713Nov 12, 2025Updated 4 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Single-stage End-to-End Training for Tokenization and Generation☆81Mar 24, 2026Updated 2 weeks ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.☆24Jul 18, 2025Updated 8 months ago
- Model souping for LLMs☆73Nov 18, 2025Updated 4 months ago
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆11Jul 9, 2025Updated 9 months ago
- Python package for P2 (Path Planning), a masked diffusion model sampling method for sequence generation (protein, text, etc.).☆23Aug 19, 2025Updated 7 months ago
- Personalized knowledge graph summarization based on historical queries☆14Jun 17, 2020Updated 5 years ago
- Guichan is a C++ GUI library designed for games.☆14Oct 22, 2025Updated 5 months ago
- ☆36Mar 7, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools☆189Mar 12, 2026Updated 3 weeks ago
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆27Dec 17, 2024Updated last year
- ☆52May 19, 2025Updated 10 months ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆247Feb 3, 2026Updated 2 months ago
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆31Jan 27, 2026Updated 2 months ago
- Code accompanying the NeurIPS 2019 paper AutoAssist: A Framework to Accelerate Training of Deep Neural Networks.☆14Oct 3, 2022Updated 3 years ago
- dLLM: Simple Diffusion Language Modeling☆2,338Feb 27, 2026Updated last month
- High-performance distributed data shuffling (all-to-all) library for MoE training and inference☆117Mar 7, 2026Updated last month
- A Secure Version of DATAVIEW using SGX techniques.☆10Jul 6, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆15Dec 2, 2019Updated 6 years ago
- ☆40Jan 16, 2026Updated 2 months ago
- How to plot for papers, slides, demos, etc.☆10Apr 7, 2022Updated 4 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆16Apr 29, 2025Updated 11 months ago
- Azərbaycan dilində informatika, proqramlaşdırma və kompüter elmləri haqqında açıq və ictimai resurs platforması.☆45Mar 12, 2026Updated 3 weeks ago
- [NeurIPS 2025] ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive☆66Dec 11, 2025Updated 4 months ago
- A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.☆103Dec 17, 2025Updated 3 months ago