☆155Mar 30, 2026Updated last week
Alternatives and similar repositories for DiRL
Users that are interested in DiRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Nov 6, 2025Updated 5 months ago
- [ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter☆162Feb 27, 2026Updated last month
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆95Dec 27, 2025Updated 3 months ago
- Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆565Mar 1, 2026Updated last month
- MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…☆181Mar 6, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆87Mar 27, 2026Updated 2 weeks ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆486Jan 28, 2026Updated 2 months ago
- ☆92Nov 17, 2025Updated 4 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆129May 22, 2025Updated 10 months ago
- Source code for paper "Empirical Analysis of Decoding Biases in Masked Diffusion Models"☆39Jan 11, 2026Updated 3 months ago
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆117Jul 9, 2025Updated 9 months ago
- An agent for CUDA compute-communication kernel co-design☆34Mar 24, 2026Updated 2 weeks ago
- ☆16Jul 23, 2024Updated last year
- SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)☆346Mar 16, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆201Nov 17, 2025Updated 4 months ago
- [NeurIPS 2025 Spotlight] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities☆75Dec 21, 2025Updated 3 months ago
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆164Feb 16, 2026Updated last month
- ☆21Jul 25, 2025Updated 8 months ago
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"☆53Jun 16, 2025Updated 9 months ago
- Dream 7B, a large diffusion language model☆1,211Nov 21, 2025Updated 4 months ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆915Updated this week
- ☆38Aug 7, 2025Updated 8 months ago
- Personalized knowledge graph summarization based on historical queries☆14Jun 17, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆430Jan 26, 2026Updated 2 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆385May 31, 2025Updated 10 months ago
- MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning☆44Sep 3, 2025Updated 7 months ago
- ☆335Mar 23, 2026Updated 2 weeks ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆44Feb 10, 2026Updated 2 months ago
- Language Models for Code Completion: a Practical Evaluation☆13Jan 19, 2024Updated 2 years ago
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆94Mar 12, 2026Updated 3 weeks ago
- This code implements the algorithm of FIPO, a value-free RL recipe for eliciting deeper reasoning from a clean base model.☆89Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Accelerate LLM preference tuning via prefix sharing with a single line of code☆51Jul 4, 2025Updated 9 months ago
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆50Mar 13, 2026Updated 3 weeks ago
- ☆18Oct 17, 2024Updated last year
- 🧮 Algebraic Positional Encodings.☆20Aug 20, 2025Updated 7 months ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Oct 22, 2025Updated 5 months ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆116Sep 27, 2025Updated 6 months ago
- RISC-V-based many-core neuromorphic architecture☆16Aug 3, 2025Updated 8 months ago