[ICML 2026] d3LLM: Ultra-Fast Diffusion LLM 🚀
☆146May 1, 2026Updated 2 months ago
Alternatives and similar repositories for d3LLM
Users that are interested in d3LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆21Nov 3, 2025Updated 8 months ago
- ☆55Apr 14, 2026Updated 2 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆29Updated this week
- Code for "BayesAdapter: Being Bayesian, Inexpensively and Robustly, via Bayeisan Fine-tuning"☆32Jul 25, 2024Updated last year
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆64Dec 18, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Mar 18, 2024Updated 2 years ago
- ☆45Feb 27, 2026Updated 4 months ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆58Dec 28, 2025Updated 6 months ago
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning☆69Oct 31, 2025Updated 8 months ago
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆15Sep 18, 2020Updated 5 years ago
- ☆10May 26, 2020Updated 6 years ago
- A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and cach…☆61Oct 27, 2025Updated 8 months ago
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers☆15Feb 7, 2025Updated last year
- Research work aimed at addressing the problem of modeling infinite-length context☆49Dec 18, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICLR 2026] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning☆45Feb 22, 2026Updated 4 months ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆84Mar 9, 2026Updated 3 months ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Nov 6, 2025Updated 7 months ago
- Python package for P2 (Path Planning), a masked diffusion model sampling method for sequence generation (protein, text, etc.).☆23Aug 19, 2025Updated 10 months ago
- A repository to introduce the algorithmic information theory. You could learn what is Kolmogorov complexity and why it is important here.☆13Jul 23, 2025Updated 11 months ago
- Flexible and Pluggable Serving Engine for Diffusion LLMs☆112Jun 26, 2026Updated last week
- Release doc/tutorial/wheels for poseidon-tf☆10Jan 18, 2018Updated 8 years ago
- ☆17Jun 10, 2022Updated 4 years ago
- Official implement of paper "Revisiting Multimodal Positional Encoding in Vision–Language Models", ICLR 2026☆88May 4, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for orthogonal neural operator☆17Oct 15, 2023Updated 2 years ago
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆31Apr 22, 2025Updated last year
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆98Dec 27, 2025Updated 6 months ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆510Jan 28, 2026Updated 5 months ago
- ☆28Jun 25, 2026Updated last week
- ☆46Mar 17, 2026Updated 3 months ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- ☆19Jun 21, 2021Updated 5 years ago
- [ICML 2026] Jacobi Forcing: Fast and Accurate Diffusion-style Decoding☆118Feb 20, 2026Updated 4 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Accelerating MoE with IO and Tile-aware Optimizations☆720Jun 26, 2026Updated last week
- [ACL '26] Source code for paper "Empirical Analysis of Decoding Biases in Masked Diffusion Models"☆40Jun 26, 2026Updated last week
- Dynamic resources changes for multi-dimensional parallelism training☆31Aug 22, 2025Updated 10 months ago
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligence☆117May 11, 2026Updated last month
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆16Jan 16, 2026Updated 5 months ago
- Internal utility libraries for Pkl☆17Jun 25, 2026Updated last week
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆104May 8, 2026Updated last month