☆33Oct 13, 2025Updated 5 months ago
Alternatives and similar repositories for speculative_thinking
Users that are interested in speculative_thinking are comparing it to the libraries listed below
Sorting:
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆67Oct 2, 2025Updated 5 months ago
- ☆20May 14, 2025Updated 10 months ago
- ☆21Mar 5, 2026Updated 2 weeks ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆61Dec 26, 2025Updated 2 months ago
- ☆14Apr 14, 2025Updated 11 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Apr 11, 2025Updated 11 months ago
- ☆28May 24, 2025Updated 9 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆98Feb 21, 2025Updated last year
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- ☆19Jan 3, 2025Updated last year
- [ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.☆56May 2, 2025Updated 10 months ago
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆31Jul 6, 2025Updated 8 months ago
- Control LLM☆22Apr 6, 2025Updated 11 months ago
- ☆18Nov 20, 2024Updated last year
- Continuous Pipelined Speculative Decoding☆18Jan 4, 2026Updated 2 months ago
- [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling☆54Jul 15, 2025Updated 8 months ago
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆49Feb 2, 2026Updated last month
- minimal C implementation of speculative decoding based on llama2.c☆28Jul 15, 2024Updated last year
- ☆15Nov 7, 2024Updated last year
- Kinetics: Rethinking Test-Time Scaling Laws☆86Jul 11, 2025Updated 8 months ago
- ☆52Feb 12, 2025Updated last year
- ☆27May 30, 2025Updated 9 months ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆16Jun 28, 2024Updated last year
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆21Feb 17, 2025Updated last year
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- ☆13Jan 22, 2025Updated last year
- My personal site, using Wowchemy☆12Updated this week
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆58Nov 5, 2025Updated 4 months ago
- ☆146Sep 12, 2025Updated 6 months ago
- [ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length☆148Dec 23, 2025Updated 2 months ago
- [TVCG & VR'25] LAPIG: Language Guided Projector Image Generation with Surface Adaptation and Stylization☆10Nov 9, 2025Updated 4 months ago
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆142Dec 17, 2025Updated 3 months ago
- ☆39May 20, 2025Updated 10 months ago
- Linux on RISC-V on FPGA (LOROF): RV64GC Sv39 Quad-Core Superscalar Out-of-Order Virtual Memory CPU☆15Feb 23, 2026Updated 3 weeks ago
- Sys2Bench is a benchmarking suite designed to evaluate reasoning and planning capabilities of large language models across algorithmic, l…☆29Mar 5, 2025Updated last year
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Aug 5, 2025Updated 7 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated last year
- ☆14Jun 24, 2024Updated last year
- ☆12Sep 1, 2023Updated 2 years ago