WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups over vLLM-optimized baselines.
☆643Mar 3, 2026Updated 3 months ago
Alternatives and similar repositories for WeDLM
Users that are interested in WeDLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of dLLM-Var☆35Nov 6, 2025Updated 7 months ago
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆42Jul 18, 2025Updated 11 months ago
- [ICML 2026] d3LLM: Ultra-Fast Diffusion LLM 🚀☆141May 1, 2026Updated last month
- Run Claude Code/Codex within AgentFS, orchestrated by LlamaIndex Workflows☆324Dec 19, 2025Updated 6 months ago
- ☆15Apr 14, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆57Mar 12, 2026Updated 3 months ago
- GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…☆338Nov 11, 2025Updated 7 months ago
- All-in-One Safety Evaluation Framwork☆50Apr 21, 2026Updated last month
- [ICLR'26] Official code of paper "d2Cache: Accelerating Diffusion-based LLMs via Dual Adaptive Caching"☆131May 14, 2026Updated last month
- ☆48Oct 23, 2025Updated 7 months ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆1,041May 30, 2026Updated 2 weeks ago
- Official Codebase For paper "One-step Language Modeling via Continuous Denoising"☆143Jun 12, 2026Updated last week
- A Rust crate offering similar functionality to the Python transformers package using Candle.☆15Nov 19, 2024Updated last year
- Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters☆17May 30, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆206May 1, 2026Updated last month
- ✨ Iter-X is an AI-powered travel app that helps users create personalized itineraries, discover trending destinations, and share trip ide…☆16Apr 26, 2025Updated last year
- This repo holds the research projects of our lab.☆11Jan 20, 2024Updated 2 years ago
- A Structured Output Benchmark whose 'ground-truth' is actually right☆19Dec 5, 2025Updated 6 months ago
- ☆17Sep 1, 2024Updated last year
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆42Nov 19, 2025Updated 7 months ago
- Easy and Efficient dLLM Fine-Tuning☆259Mar 2, 2026Updated 3 months ago
- FlashKDA: high-performance Kimi Delta Attention kernels☆448May 26, 2026Updated 3 weeks ago
- Standardized environment infrastructure for Agentic AI development.☆307Jun 11, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆47Sep 15, 2025Updated 9 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆754Jun 6, 2025Updated last year
- [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs☆145Apr 27, 2026Updated last month
- This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…☆68Apr 8, 2026Updated 2 months ago
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆74May 13, 2026Updated last month
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆223Jun 26, 2025Updated 11 months ago
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆98Dec 27, 2025Updated 5 months ago
- Python library to add support for embedding natural code in Python with shared program state.☆30Jan 20, 2026Updated 4 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆133May 22, 2025Updated last year
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆142Aug 13, 2025Updated 10 months ago
- Распознавание рукописного текста в школьных тетрадях☆20Feb 15, 2023Updated 3 years ago
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆170Feb 16, 2026Updated 4 months ago
- Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.☆115Jan 14, 2026Updated 5 months ago
- Benchmark tests supporting the TiledCUDA library.☆19Nov 19, 2024Updated last year
- FlashInfer Bench @ MLSys 2026: Building AI agents to write high performance GPU kernels☆171Apr 26, 2026Updated last month