Chengsong-Huang / Self-CalibrationLinks

codes for Efficient Test-Time Scaling via Self-Calibration

☆18

Alternatives and similar repositories for Self-Calibration

Users that are interested in Self-Calibration are comparing it to the libraries listed below

Sorting:

NuoJohnChen / JudgeLRM
JudgeLRM: Large Reasoning Models as a Judge
☆40Updated 2 months ago
Dereck0602 / Awesome_Test_Time_LLMs
☆131Updated 8 months ago
zjunlp / LightThinker
[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression
☆123Updated 7 months ago
MingyuJ666 / Disentangling-Memory-and-Reasoning
[ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.
☆79Updated 3 weeks ago
EIT-NLP / Distilling-CoT-Reasoning
[ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".
☆19Updated 9 months ago
LightChen233 / reasoning-boundary
☆69Updated 5 months ago
StarDewXXX / AdaR1
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆20Updated 3 weeks ago
luka-group / vlm-knowledge-conflict
Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."
☆48Updated last year
multimodal-art-projection / TreePO
☆50Updated last month
MingLiiii / Layer_Gradient
[ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
☆74Updated 5 months ago
yubol-bobo / Awesome-Multi-Turn-LLMs
This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …
☆143Updated 6 months ago
Raibows / CREAM
Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.
☆27Updated 9 months ago
ReasoningTransfer / Transferability-of-LLM-Reasoning
☆104Updated last month
yafuly / TPO
Test-time preferenece optimization (ICML 2025).
☆170Updated 6 months ago
ZJU-REAL / EasySteer
A Unified Framework for High-Performance and Extensible LLM Steering
☆131Updated last week
zjunlp / unlearn
[ACL 2025] Knowledge Unlearning for Large Language Models
☆46Updated 2 months ago
YangLing0818 / SuperCorrect-llm
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
☆83Updated 8 months ago
lichengliu03 / unary-feedback
☆38Updated 3 months ago
StarDewXXX / O1-Pruner
Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
☆97Updated 9 months ago
mathllm / MATH-V
[NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.
☆123Updated 6 months ago
sunnweiwei / FoldAgent
☆63Updated last month
sail-sg / CPO
[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.
☆132Updated 8 months ago
SeanLeng1 / Reward-Calibration
☆20Updated 11 months ago
SihengLi99 / SEALONG
Large Language Models Can Self-Improve in Long-context Reasoning
☆73Updated last year
hkust-nlp / GUIMid
☆21Updated 6 months ago
yuelinan / Awesome-Efficient-R1-style-LRMs
☆45Updated 3 months ago
Evanwu1125 / LiteCoT
☆15Updated 5 months ago
yayayacc / MUR
☆46Updated last month
aeroplanepaper / GRPO-LEAD
☆30Updated last week
dongxiangjue / Awesome-LLM-Self-Improvement
A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …
☆97Updated 11 months ago