eth-sri / constrained-diffusionLinks
Constrained Decoding of Diffusion LLMs with Context-Free Grammars.
☆39Updated last month
Alternatives and similar repositories for constrained-diffusion
Users that are interested in constrained-diffusion are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆149Updated 4 months ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆104Updated 4 months ago
- Official Repository of Native Parallel Reasoner☆100Updated 2 weeks ago
- ☆148Updated this week
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆161Updated last week
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆57Updated 6 months ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆177Updated 3 weeks ago
- ☆74Updated last year
- ☆110Updated 4 months ago
- ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…☆114Updated last month
- ☆134Updated last week
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆107Updated 8 months ago
- ☆76Updated last month
- ☆131Updated 9 months ago
- Universal Reasoning Model☆122Updated 3 weeks ago
- ☆44Updated 9 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Updated 9 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆128Updated last year
- ☆123Updated 11 months ago
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆138Updated last month
- A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.☆103Updated this week
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆69Updated last year
- ☆21Updated 8 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 11 months ago
- ☆45Updated 7 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆120Updated last week
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Updated 3 months ago
- This is the official implementation for paper "PENCIL: Long Thoughts with Short Memory".☆73Updated 8 months ago
- [ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.☆55Updated 9 months ago
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆89Updated 3 months ago