LAMDASZ-ML/Self-Backtracking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LAMDASZ-ML/Self-Backtracking)

LAMDASZ-ML / Self-Backtracking

☆52

Alternatives and similar repositories for Self-Backtracking

Users that are interested in Self-Backtracking are comparing it to the libraries listed below

Sorting:

QizhiPei / MathFusion
View on GitHub
MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)
☆35Jul 16, 2025Updated 8 months ago
GeniusHTX / TALE
View on GitHub
☆146Sep 12, 2025Updated 6 months ago
uservan / speculative_thinking
View on GitHub
☆33Oct 13, 2025Updated 5 months ago
NineAbyss / S2R
View on GitHub
This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
☆73Apr 22, 2025Updated 10 months ago
wenlinyao / HDFlow
View on GitHub
Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows
☆14Oct 4, 2024Updated last year
Emo-gml / PsyLLM
View on GitHub
Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling
☆29Jan 24, 2026Updated last month
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
tatsu-lab / linguistic_calibration
View on GitHub
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆29Jun 4, 2024Updated last year
THU-KEG / PairJudgeRM
View on GitHub
☆14Apr 14, 2025Updated 11 months ago
cs-holder / Reasoning-Self-Evolution-Survey
View on GitHub
☆55Mar 6, 2025Updated last year
thu-coai / SPaR
View on GitHub
☆46Jun 11, 2025Updated 9 months ago
Raibows / CREAM
View on GitHub
Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.
☆29Feb 17, 2025Updated last year
SWE-EVO / SWE-EVO
View on GitHub
☆35Jan 25, 2026Updated last month
byronBBL / Context-DPO
View on GitHub
Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"
☆21Feb 17, 2025Updated last year
tval2 / contextual-pruning
View on GitHub
Library to facilitate pruning of LLMs based on context
☆32Jan 31, 2024Updated 2 years ago
llong-cs / LaGAM
View on GitHub
Code for CVPR 2024 paper: Positive-Unlabeled Learning by Latent Group-Aware Meta Disambiguation
☆21May 19, 2024Updated last year
ZhangXJ199 / EDGE-GRPO
View on GitHub
Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity
☆22Aug 28, 2025Updated 6 months ago
StarDewXXX / O1-Pruner
View on GitHub
Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
☆98Feb 21, 2025Updated last year
UCSB-NLP-Chang / ThinkPrune
View on GitHub
☆46Sep 27, 2025Updated 5 months ago
open-compass / GPassK
View on GitHub
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆32Aug 5, 2025Updated 7 months ago
beichenzbc / BoostStep
View on GitHub
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆37Jan 21, 2025Updated last year
SihengLi99 / SEALONG
View on GitHub
Large Language Models Can Self-Improve in Long-context Reasoning
☆73Nov 24, 2024Updated last year
LAMDASZ-ML / ChinaTravel
View on GitHub
ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning
☆92Feb 13, 2026Updated last month
zeyofu / ReFocus_Code
View on GitHub
Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]
☆46Jul 22, 2025Updated 8 months ago
ludybupt / FATRER
View on GitHub
[ECAI 2023] Official implementation of "FATRER: Full-Attention Topic Regularizer for Accurate and Robust Conversational Emotion Recogniti…
☆13Oct 9, 2023Updated 2 years ago
kyegomez / EAOT
View on GitHub
The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"
☆19Mar 11, 2024Updated 2 years ago
waterhorse1 / Natural-language-RL
View on GitHub
Natural Language Reinforcement Learning
☆102Jul 30, 2025Updated 7 months ago
linkedin / ControlLLM
View on GitHub
Control LLM
☆22Apr 6, 2025Updated 11 months ago
whunextgen / LLMindCraft
View on GitHub
Shaping Language Models with Cognitive Insights
☆15Feb 29, 2024Updated 2 years ago
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
RUC-NLPIR / OmniGAIA
View on GitHub
OmniGAIA: Towards Native Omni-Modal AI Agents
☆82Updated this week
THU-KEG / Agentic-Reward-Modeling
View on GitHub
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆125Jun 11, 2025Updated 9 months ago
yqh1988 / PageViewDemo
View on GitHub
仿今日头条的多页面滑动切换
☆11Sep 7, 2018Updated 7 years ago
bethgelab / sober-reasoning
View on GitHub
A Sober Look at Language Model Reasoning
☆94Nov 18, 2025Updated 4 months ago
xiye17 / TextualExplInContext
View on GitHub
The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)
☆16Feb 11, 2023Updated 3 years ago
sauc-abadal / ALT
View on GitHub
Official repository for ALT (ALignment with Textual feedback).
☆10Jul 25, 2024Updated last year
feiyang-k / AutoScale
View on GitHub
Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…
☆13Aug 8, 2025Updated 7 months ago
aeroplanepaper / GRPO-LEAD
View on GitHub
☆34Nov 18, 2025Updated 4 months ago
KOR-Bench / KOR-Bench
View on GitHub
☆19Nov 12, 2024Updated last year