d3LLM: Ultra-Fast Diffusion LLM π
β120Apr 25, 2026Updated last week
Alternatives and similar repositories for d3LLM
Users that are interested in d3LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β55Apr 14, 2026Updated 3 weeks ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scalingβ22Updated this week
- Efficient Long-context Language Model Training by Core Attention Disaggregationβ98Apr 7, 2026Updated 3 weeks ago
- Code for "BayesAdapter: Being Bayesian, Inexpensively and Robustly, via Bayeisan Fine-tuning"β32Jul 25, 2024Updated last year
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learningβ63Dec 18, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β12Mar 18, 2024Updated 2 years ago
- β43Feb 27, 2026Updated 2 months ago
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoningβ68Oct 31, 2025Updated 6 months ago
- β21Jun 9, 2025Updated 10 months ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".β56Dec 28, 2025Updated 4 months ago
- A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and cachβ¦β61Oct 27, 2025Updated 6 months ago
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformersβ15Feb 7, 2025Updated last year
- Research work aimed at addressing the problem of modeling infinite-length contextβ48Dec 18, 2025Updated 4 months ago
- [ICLR 2026] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learningβ43Feb 22, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Release doc/tutorial/wheels for poseidon-tfβ10Jan 18, 2018Updated 8 years ago
- Flexible and Pluggable Serving Engine for Diffusion LLMsβ69Updated this week
- β17Jun 10, 2022Updated 3 years ago
- β52Aug 22, 2025Updated 8 months ago
- Official implement of paper "Revisiting Multimodal Positional Encoding in VisionβLanguage Models", ICLR 2026β79Mar 16, 2026Updated last month
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221β31Apr 22, 2025Updated last year
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.β498Jan 28, 2026Updated 3 months ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flexβ¦β26Apr 4, 2026Updated last month
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengtβ¦β97Dec 27, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β25Apr 23, 2026Updated last week
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.β34Mar 11, 2025Updated last year
- [ICML 2026] Jacobi Forcing: Fast and Accurate Diffusion-style Decodingβ124Feb 20, 2026Updated 2 months ago
- Accelerating MoE with IO and Tile-aware Optimizationsβ664Updated this week
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligenceβ107Updated this week
- β19Jun 21, 2021Updated 4 years ago
- [ACL '26] Source code for paper "Empirical Analysis of Decoding Biases in Masked Diffusion Models"β39Jan 11, 2026Updated 3 months ago
- Dynamic resources changes for multi-dimensional parallelism trainingβ31Aug 22, 2025Updated 8 months ago
- Pygloo provides Python bindings for Gloo.β22Jul 7, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Internal utility libraries for Pklβ16Apr 24, 2026Updated last week
- Defeating the Training-Inference Mismatch via FP16β192Nov 14, 2025Updated 5 months ago
- β53Jan 23, 2026Updated 3 months ago
- Asynchronous pipeline parallel optimizationβ21Feb 2, 2026Updated 3 months ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"β56Updated this week
- dInfer: An Efficient Inference Framework for Diffusion Language Modelsβ459Feb 11, 2026Updated 2 months ago
- β95Nov 17, 2025Updated 5 months ago