hao-ai-lab/JacobiForcing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hao-ai-lab/JacobiForcing)

hao-ai-lab / JacobiForcing

[ICML 2026] Jacobi Forcing: Fast and Accurate Diffusion-style Decoding

☆124

Alternatives and similar repositories for JacobiForcing

Users that are interested in JacobiForcing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SJTU-DENG-Lab / LoPA
View on GitHub
LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding
☆39Apr 25, 2026Updated 3 months ago
hao-ai-lab / d3LLM
View on GitHub
[ICML 2026] d3LLM: Ultra-Fast Diffusion LLM 🚀
☆147May 1, 2026Updated 2 months ago
SJTU-DENG-Lab / LightningRL
View on GitHub
LightningRL: Breaking the Accuracy–Parallelism Trade-off of Block-wise dLLMs via Reinforcement Learning
☆30Apr 25, 2026Updated 3 months ago
SJTU-DENG-Lab / UniCMs
View on GitHub
☆39May 20, 2025Updated last year
SJTU-DENG-Lab / Diffulex
View on GitHub
Flexible and Pluggable Serving Engine for Diffusion LLMs
☆147Jul 13, 2026Updated last week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
SJTU-DENG-Lab / AdaMoE
View on GitHub
[Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models
☆20Oct 2, 2024Updated last year
hao-ai-lab / DistCA
View on GitHub
Efficient Long-context Language Model Training by Core Attention Disaggregation
☆106Apr 7, 2026Updated 3 months ago
SJTU-DENG-Lab / SIFT
View on GitHub
SIFT: Grounding LLM Reasoning in Contexts via Stickers
☆57Mar 6, 2025Updated last year
SJTU-DENG-Lab / Discrete-Diffusion-Forcing
View on GitHub
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
☆261Feb 3, 2026Updated 5 months ago
SJTU-DENG-Lab / LatentUM
View on GitHub
☆56Apr 9, 2026Updated 3 months ago
SJTU-DENG-Lab / Mantis
View on GitHub
[CVPR 2026] Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
☆92Jun 5, 2026Updated last month
SJTU-DENG-Lab / Orthogonal-Neural-operator
View on GitHub
Code for orthogonal neural operator
☆17Oct 15, 2023Updated 2 years ago
SJTU-DENG-Lab / Orthus
View on GitHub
☆89May 15, 2025Updated last year
lmgame-org / GRL
View on GitHub
Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning
☆65Dec 18, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mit-han-lab / lpd
View on GitHub
[ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
☆104May 8, 2026Updated 2 months ago
hao-ai-lab / flash-attention-fp4
View on GitHub
NVFP4 Flash-Attention 4 on BlackWell
☆30Updated this week
hao-ai-lab / JetSpec
View on GitHub
JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Causal Parallel Tree Drafting
☆166Jun 27, 2026Updated 3 weeks ago
SJTU-DENG-Lab / WLA
View on GitHub
The official implementation of World-Language-Action Model for Unified World Modeling, Language Reasoning, and Action Synthesis
☆124Jun 18, 2026Updated last month
Dogacel / Attention-Drift
View on GitHub
Code for the paper *Attention Drift: What Speculative Decoding Models Learn*.
☆27May 12, 2026Updated 2 months ago
hao-ai-lab / LookaheadReasoning
View on GitHub
[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning
☆69Oct 31, 2025Updated 8 months ago
czg1225 / dParallel
View on GitHub
[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs
☆65Apr 12, 2026Updated 3 months ago
shengshu-ai / TurboServe
View on GitHub
TurboServe: Serving Streaming Video Generation Efficiently and Economically
☆37Jul 12, 2026Updated last week
open-compass / RePro
View on GitHub
[ICLR 2026] Rectifying LLM Thought From Lens of Optimization
☆15Dec 5, 2025Updated 7 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
hao-ai-lab / Awesome-Video-Attention
View on GitHub
A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and cach…
☆61Oct 27, 2025Updated 8 months ago
inclusionAI / dInfer
View on GitHub
dInfer: An Efficient Inference Framework for Diffusion Language Models
☆475Feb 11, 2026Updated 5 months ago
SJTU-DENG-Lab / Think-Then-Generate
View on GitHub
☆115Jul 1, 2026Updated 3 weeks ago
Introspective-Diffusion / I-DLM
View on GitHub
☆151Apr 15, 2026Updated 3 months ago
JetAstra / SDAR
View on GitHub
SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model（1.7B, 4B, 8B, 30B）
☆362Jun 2, 2026Updated last month
VegaisMVP / AI-prediction-model-research
View on GitHub
☆17Jun 19, 2025Updated last year
GregorAnderson / fukataf
View on GitHub
Integration CodeIgniter and log4php
☆16Jul 15, 2025Updated last year
multimodal-art-projection / CriticLean
View on GitHub
☆50Aug 5, 2025Updated 11 months ago
DiT-Serving / TetriServe
View on GitHub
[ASPLOS' 26] TetriServe: Efficiently Serving Mixed DiT Workloads
☆17Mar 12, 2026Updated 4 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hao-ai-lab / Dynasor
View on GitHub
[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.
☆232May 31, 2025Updated last year
sgl-project / SpecForge
View on GitHub
Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
☆1,007Updated this week
OpenGVLab / SDLM
View on GitHub
Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…
☆98Dec 27, 2025Updated 6 months ago
gybob / aai-protocol
View on GitHub
AAI (Agent App Interface) is an open protocol that makes any app accessible to AI Agents. One aai.json descriptor makes your desktop or w…
☆117Apr 7, 2026Updated 3 months ago
kuleshov-group / bd3lms
View on GitHub
[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆1,023Jul 10, 2025Updated last year
Infini-AI-Lab / vortex_torch
View on GitHub
Vortex: Programmable Sparse Attention for Agents as Algorithm Designers
☆67Jun 24, 2026Updated last month
pixeli99 / MixLN
View on GitHub
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆30Jul 24, 2025Updated last year