Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation length and maintaining KV-cache compatibility, achieving high efficiency and throughput.
☆98Dec 27, 2025Updated 6 months ago
Alternatives and similar repositories for SDLM
Users that are interested in SDLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…☆64Nov 7, 2024Updated last year
- Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Ste…☆28Mar 9, 2026Updated 3 months ago
- Build AI Agent using Google ADK , MCP and Gemma 3 model☆27Apr 23, 2025Updated last year
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆25Apr 6, 2026Updated 2 months ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆63Apr 12, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Codebase of 'From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model'☆45Updated this week
- ☆163Mar 30, 2026Updated 3 months ago
- The first unified, efficient, and extensible evaluation toolkit for evaluating image generation and editing models across multiple benchm…☆46Apr 12, 2026Updated 2 months ago
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆151May 18, 2026Updated last month
- ☆42Jun 14, 2025Updated last year
- 这是我的博客《不用框架,使用Python搭建基于numpy的卷积神经网络来进行cifar-10分类的深度学习系统》的代码实现。☆10Jul 1, 2019Updated 7 years ago
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆63Nov 5, 2024Updated last year
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆40Jan 16, 2026Updated 5 months ago
- The official code repository for the FullFront benchmark☆27May 16, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official repository of "Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling"☆14Nov 26, 2025Updated 7 months ago
- [ICLR'26] Official code of paper "d2Cache: Accelerating Diffusion-based LLMs via Dual Adaptive Caching"☆135May 14, 2026Updated last month
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆117Jul 9, 2025Updated 11 months ago
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆43Sep 20, 2025Updated 9 months ago
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆31Sep 23, 2024Updated last year
- [EMNLP 2025] RouterLens☆29Sep 15, 2025Updated 9 months ago
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)☆15Aug 16, 2024Updated last year
- ☆19Aug 4, 2025Updated 10 months ago
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings☆39Sep 13, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for "Kuramoto Orientation Diffusion"☆31Nov 7, 2025Updated 7 months ago
- Generative Regional Editing (GRE) Benchmark☆20Sep 10, 2024Updated last year
- A Pytorch implementation of "Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare To…☆10Apr 20, 2022Updated 4 years ago
- ☆14May 26, 2021Updated 5 years ago
- Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.☆24Nov 23, 2024Updated last year
- It shows how to realize agentic RAG.☆30Jun 20, 2025Updated last year
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models. FS-DFM accepted for ICLR 2026☆45Jan 6, 2026Updated 5 months ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆62May 13, 2025Updated last year
- ☆15Oct 10, 2020Updated 5 years ago
- ☆25Dec 13, 2024Updated last year
- Official Codebase For paper "One-step Language Modeling via Continuous Denoising"☆145Jun 12, 2026Updated 2 weeks ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆207May 1, 2026Updated 2 months ago
- The code for paper "Diversifying Dialog Generation via Adaptive Label Smoothing" in ACL 2021.☆26Jun 7, 2021Updated 5 years ago
- BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing☆59Mar 11, 2024Updated 2 years ago