Jikai0Wang/Speculative_CoT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jikai0Wang/Speculative_CoT)

Jikai0Wang / Speculative_CoT

☆20

Alternatives and similar repositories for Speculative_CoT

Users that are interested in Speculative_CoT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BaohaoLiao / RSD
View on GitHub
[ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.
☆56May 2, 2025Updated last year
uservan / speculative_thinking
View on GitHub
☆34Oct 13, 2025Updated 9 months ago
Jikai0Wang / OPT-Tree
View on GitHub
☆30May 24, 2025Updated last year
ruipeterpan / specreason
View on GitHub
PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]
☆75Oct 2, 2025Updated 9 months ago
THU-KEG / AtomR
View on GitHub
[KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning
☆15May 27, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
liuchengwucn / Safe
View on GitHub
(ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…
☆21Dec 26, 2025Updated 7 months ago
StarDewXXX / AdaR1
View on GitHub
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆24May 6, 2026Updated 2 months ago
kevinscaria / TarGEN
View on GitHub
Targeted Data Generation with Large Language Models
☆19Jun 25, 2024Updated 2 years ago
AI9Stars / AStar-Thought
View on GitHub
[NeurIPS 2025] A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings
☆16Jun 12, 2026Updated last month
abdelfattah-lab / SplitReason
View on GitHub
☆20Mar 18, 2026Updated 4 months ago
StarDewXXX / Awesome-Hybrid-CoT-Reasoning
View on GitHub
☆62Jun 7, 2025Updated last year
subingangadharan / cmu15418
View on GitHub
My solution code to parallel architecture and programming Spring 2016
☆12Aug 15, 2016Updated 9 years ago
xiezheng-cs / DTQ
View on GitHub
PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)
☆18Jun 22, 2022Updated 4 years ago
ASTRAL-Group / LoRe
View on GitHub
When Reasoning Meets Its Laws
☆38Jan 2, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
garipovroma / autojudge
View on GitHub
[NeurIPS 2025] Official PyTorch implementation for the paper AutoJudge: Judge Decoding Without Manual Annotation
☆21Dec 22, 2025Updated 7 months ago
LMCache / lmcache-agent-trace
View on GitHub
Agent application/benchmark/workload traces should be placed here.
☆15Apr 13, 2026Updated 3 months ago
jokieleung / CL-VQA
View on GitHub
the implementation of EMNLP 2020 "Learning to Contrast the Counterfactual Samples for Robust Visual Question Answering"
☆14Sep 9, 2021Updated 4 years ago
euiin / SMART
View on GitHub
SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…
☆12Jul 9, 2025Updated last year
vuhpdc / jellyfish
View on GitHub
Source code for Jellyfish, a soft real-time inference serving system
☆15Dec 20, 2022Updated 3 years ago
Xnhyacinth / NesyCD
View on GitHub
[AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks
☆12Jun 19, 2025Updated last year
wang8740 / MAP
View on GitHub
Documentation at
☆14Mar 27, 2025Updated last year
zhuhanqing / Lightening-Transformer-AE
View on GitHub
Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…
☆11Mar 3, 2024Updated 2 years ago
ASTRAL-Group / AlphaOne
View on GitHub
[EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
☆89Jun 10, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hao-ai-lab / LookaheadReasoning
View on GitHub
[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning
☆69Oct 31, 2025Updated 9 months ago
yuelinan / Awesome-Efficient-R1-style-LRMs
View on GitHub
☆53Jul 12, 2026Updated 2 weeks ago
7tl7qns7ch / IPOT
View on GitHub
Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEs (AAAI 2024)
☆14Jul 30, 2024Updated 2 years ago
DS3Lab / Decentralized_FM_alpha
View on GitHub
☆18May 4, 2023Updated 3 years ago
zqOuO / GWT
View on GitHub
☆13May 4, 2026Updated 2 months ago
chrischia06 / neural-network-derivative-pricing
View on GitHub
Survey of neural network methods for derivatives pricing and risks
☆14Jul 5, 2022Updated 4 years ago
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
LedgeDash / unum
View on GitHub
☆12Oct 16, 2022Updated 3 years ago
ccvl / iep-ref
View on GitHub
Inferring and Executing Programs for Visual Reasoning
☆21Jan 4, 2019Updated 7 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
chuhac / Reasoning-to-Defend
View on GitHub
[EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
☆12Aug 22, 2025Updated 11 months ago
MaHuanAAA / logtoku
View on GitHub
☆42Aug 21, 2025Updated 11 months ago
AIFrameResearch / SPO
View on GitHub
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models
☆55Sep 19, 2025Updated 10 months ago
Peiyannn / MM-PDE
View on GitHub
[ICLR24] Better Neural PDE Solvers Through Data-Free Mesh Movers
☆17Mar 20, 2024Updated 2 years ago
ArminAzizi98 / LaMDA
View on GitHub
☆15Nov 7, 2024Updated last year
GATECH-EIC / Auto-NBA
View on GitHub
[ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…
☆16Jan 3, 2022Updated 4 years ago
Cascol-Chen / COLA
View on GitHub
Code for NeurIPS 2024 paper — Cross-Device Collaborative Test-Time Adaptation
☆17Feb 28, 2025Updated last year