facebookresearch/deepconf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/deepconf)

facebookresearch / deepconf

DeepConf: Deep Think with Confidence

☆408

Alternatives and similar repositories for deepconf

Users that are interested in deepconf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hao-ai-lab / Dynasor
View on GitHub
[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.
☆231May 31, 2025Updated last year
hao-ai-lab / LookaheadReasoning
View on GitHub
[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning
☆69Oct 31, 2025Updated 8 months ago
backprop07 / Self-Certainty
View on GitHub
Implementation of self-certainty as an extention of ZeroEval Project
☆38May 31, 2025Updated last year
ArminAzizi98 / ASC
View on GitHub
Activation-Steered Compression
☆17Jan 30, 2026Updated 5 months ago
TIGER-AI-Lab / Hierarchical-Reasoner
View on GitHub
Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]
☆64Apr 11, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Shenzhi-Wang / Beyond-the-80-20-Rule-RLVR
View on GitHub
The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…
☆60Jan 5, 2026Updated 6 months ago
PRIME-RL / TTRL
View on GitHub
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
☆1,100Apr 15, 2026Updated 3 months ago
StarDewXXX / O1-Pruner
View on GitHub
Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
☆99Feb 21, 2025Updated last year
XLearning-SCU / Reliable_TWI
View on GitHub
Pytorch Implementation of Reliable Thinking with Images.
☆26May 3, 2026Updated 2 months ago
Jikai0Wang / Speculative_CoT
View on GitHub
☆20May 14, 2025Updated last year
yuelinan / Awesome-Efficient-R1-style-LRMs
View on GitHub
☆53Jul 12, 2026Updated last week
aakaran / reasoning-with-sampling
View on GitHub
☆438Nov 7, 2025Updated 8 months ago
zxiangx / LC-R1
View on GitHub
Code for paper: Optimizing Length Compression in Large Reasoning Models
☆29Oct 20, 2025Updated 9 months ago
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,587Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zjunlp / LightThinker
View on GitHub
[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression
☆165Jun 22, 2026Updated last month
uservan / speculative_thinking
View on GitHub
☆34Oct 13, 2025Updated 9 months ago
McGill-NLP / the-markovian-thinker
View on GitHub
Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"
☆349Mar 16, 2026Updated 4 months ago
multimodal-art-projection / TreePO
View on GitHub
☆65Mar 30, 2026Updated 3 months ago
bethgelab / sober-reasoning
View on GitHub
A Sober Look at Language Model Reasoning
☆92Nov 18, 2025Updated 8 months ago
ruipeterpan / specreason
View on GitHub
PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]
☆74Oct 2, 2025Updated 9 months ago
PRIME-RL / Entropy-Mechanism-of-RL
View on GitHub
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆443Jul 11, 2025Updated last year
ltzheng / SimpleTIR
View on GitHub
[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆401Mar 30, 2026Updated 3 months ago
THU-KEG / LRM-FactEval
View on GitHub
☆17Jun 25, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
GeniusHTX / TALE
View on GitHub
☆151Sep 12, 2025Updated 10 months ago
NVlabs / DLER
View on GitHub
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning
☆17Nov 11, 2025Updated 8 months ago
AMAP-ML / Tree-GRPO
View on GitHub
[ICLR 2026] Tree Search for LLM Agent Reinforcement Learning
☆386Jan 26, 2026Updated 5 months ago
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,708Updated this week
hemingkx / Awesome-Efficient-Reasoning
View on GitHub
Paper list for Efficient Reasoning.
☆900May 29, 2026Updated last month
justinwjl / GTB-DTI
View on GitHub
☆12Aug 11, 2025Updated 11 months ago
kxfan2002 / Reagent
View on GitHub
Agent-RRM: Exploring Reasoning Reward Model for Agents
☆70Mar 17, 2026Updated 4 months ago
lasgroup / SDPO
View on GitHub
Reinforcement Learning via Self-Distillation (SDPO)
☆1,017Jul 1, 2026Updated 2 weeks ago
BaohaoLiao / RSD
View on GitHub
[ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.
☆56May 2, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
sail-sg / scaling-with-vocab
View on GitHub
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
☆112Sep 26, 2024Updated last year
fscdc / Awesome-Efficient-Reasoning-Models
View on GitHub
[TMLR 2025] Efficient Reasoning Models: A Survey
☆314Jun 26, 2026Updated 3 weeks ago
damanimehul / RLCR
View on GitHub
Official repository for Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty
☆67Aug 20, 2025Updated 11 months ago
nusnlp / FSPO
View on GitHub
Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"
☆26Oct 31, 2025Updated 8 months ago
hao-ai-lab / JetSpec
View on GitHub
JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Causal Parallel Tree Drafting
☆163Jun 27, 2026Updated 3 weeks ago
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago