[ICML 2026] d3LLM: Ultra-Fast Diffusion LLM 🚀
☆127May 1, 2026Updated 3 weeks ago
Alternatives and similar repositories for d3LLM
Users that are interested in d3LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆55Apr 14, 2026Updated last month
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆24May 14, 2026Updated last week
- Code for "BayesAdapter: Being Bayesian, Inexpensively and Robustly, via Bayeisan Fine-tuning"☆32Jul 25, 2024Updated last year
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆103Apr 7, 2026Updated last month
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆64Dec 18, 2025Updated 5 months ago
- ☆43Feb 27, 2026Updated 2 months ago
- ☆21Jun 9, 2025Updated 11 months ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆56Dec 28, 2025Updated 4 months ago
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning☆69Oct 31, 2025Updated 6 months ago
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆15Sep 18, 2020Updated 5 years ago
- A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and cach…☆61Oct 27, 2025Updated 6 months ago
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers☆15Feb 7, 2025Updated last year
- Research work aimed at addressing the problem of modeling infinite-length context☆48Dec 18, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2026] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning☆43Feb 22, 2026Updated 3 months ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆84Mar 9, 2026Updated 2 months ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Nov 6, 2025Updated 6 months ago
- Python package for P2 (Path Planning), a masked diffusion model sampling method for sequence generation (protein, text, etc.).☆23Aug 19, 2025Updated 9 months ago
- A repository to introduce the algorithmic information theory. You could learn what is Kolmogorov complexity and why it is important here.☆13Jul 23, 2025Updated 10 months ago
- Flexible and Pluggable Serving Engine for Diffusion LLMs☆69May 2, 2026Updated 3 weeks ago
- Release doc/tutorial/wheels for poseidon-tf☆10Jan 18, 2018Updated 8 years ago
- ☆53Aug 22, 2025Updated 9 months ago
- Official implement of paper "Revisiting Multimodal Positional Encoding in Vision–Language Models", ICLR 2026☆84May 4, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆87May 15, 2025Updated last year
- Code for orthogonal neural operator☆18Oct 15, 2023Updated 2 years ago
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆31Apr 22, 2025Updated last year
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆42Jul 18, 2025Updated 10 months ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆506Jan 28, 2026Updated 3 months ago
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆97Dec 27, 2025Updated 4 months ago
- ☆26Apr 23, 2026Updated last month
- [ICML 2026] Jacobi Forcing: Fast and Accurate Diffusion-style Decoding☆114Feb 20, 2026Updated 3 months ago
- ☆19Jun 21, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Accelerating MoE with IO and Tile-aware Optimizations☆684May 14, 2026Updated last week
- Light Object-Relational Environment (LORE) provides a simple and lightweight pseudo-ORM/pseudo-struct-mapping environment for Go☆14Oct 21, 2017Updated 8 years ago
- [ACL '26] Source code for paper "Empirical Analysis of Decoding Biases in Masked Diffusion Models"☆39Jan 11, 2026Updated 4 months ago
- Dynamic resources changes for multi-dimensional parallelism training☆31Aug 22, 2025Updated 9 months ago
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligence☆110May 11, 2026Updated 2 weeks ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆15Jan 16, 2026Updated 4 months ago
- Internal utility libraries for Pkl☆16May 14, 2026Updated last week