zhijie-group / SIFTLinks

SIFT: Grounding LLM Reasoning in Contexts via Stickers

☆57

Alternatives and similar repositories for SIFT

Users that are interested in SIFT are comparing it to the libraries listed below

Sorting:

UCSB-NLP-Chang / ThinkPrune
☆45Updated 3 months ago
sail-sg / AnytimeReasoner
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆50Updated 5 months ago
TIGER-AI-Lab / AceCoder
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆95Updated 8 months ago
YangLing0818 / SuperCorrect-llm
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
☆86Updated 9 months ago
test-time-interaction / TTI
☆68Updated 6 months ago
Gen-Verse / CURE
[NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning
☆145Updated 3 months ago
hkust-nlp / B-STaR
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
☆86Updated 7 months ago
zjunlp / LightThinker
[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression
☆127Updated 8 months ago
Infini-AI-Lab / Multiverse
☆109Updated 3 months ago
efficientscaling / Z1
[EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"
☆68Updated 8 months ago
LAMDASZ-ML / Self-Backtracking
☆50Updated 10 months ago
NuoJohnChen / JudgeLRM
JudgeLRM: Large Reasoning Models as a Judge
☆40Updated 3 weeks ago
HKUNLP / critic-rl
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆120Updated 8 months ago
GAIR-NLP / OctoThinker
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆182Updated 5 months ago
Interplay-LM-Reasoning / Interplay-LM-Reasoning
☆110Updated 3 weeks ago
GeniusHTX / TALE
☆142Updated 3 months ago
StarDewXXX / O1-Pruner
Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
☆96Updated 10 months ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆88Updated 10 months ago
Infini-AI-Lab / S2FT
☆19Updated last year
sail-sg / feedback-conditional-policy
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆55Updated this week
NineAbyss / S2R
This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
☆72Updated 8 months ago
THUDM / T1
RL Scaling and Test-Time Scaling (ICML'25)
☆112Updated 11 months ago
MiniMax-AI / SynLogic
[NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
☆188Updated 6 months ago
SihengLi99 / SEALONG
Large Language Models Can Self-Improve in Long-context Reasoning
☆73Updated last year
hkust-nlp / mstar
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆70Updated 5 months ago
xufangzhi / phi-Decoding
[ACL 2025] An inference-time decoding strategy with adaptive foresight sampling
☆105Updated 7 months ago
SalesforceAIResearch / GemFilter
☆85Updated last month
PRIME-RL / P1
P1: Mastering Physics Olympiads with Reinforcement Learning
☆69Updated last week
Trae1ounG / BuPO
[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
☆47Updated last week
xufangzhi / Genius
[ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework
☆71Updated 7 months ago