MingyuJ666 / Disentangling-Memory-and-ReasoningLinks

[ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.

☆79

Alternatives and similar repositories for Disentangling-Memory-and-Reasoning

Users that are interested in Disentangling-Memory-and-Reasoning are comparing it to the libraries listed below

Sorting:

RyanLiu112 / GenPRM
[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
☆85Updated last week
LightChen233 / reasoning-boundary
☆69Updated 4 months ago
THU-KEG / AdaptThink
☆163Updated last month
RUCKBReasoning / CoT-based-Synthesizer
Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'
☆31Updated 5 months ago
bobxwu / learning-from-rewards-llm-papers
A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…
☆58Updated 5 months ago
lichengliu03 / unary-feedback
☆38Updated 3 months ago
USTC-StarTeam / ZIP
☆25Updated last year
RM-R1-UIUC / RM-R1
RM-R1: Unleashing the Reasoning Potential of Reward Models
☆148Updated 4 months ago
cs-holder / Reasoning-Self-Evolution-Survey
☆51Updated 8 months ago
MozerWang / AMPO
[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents
☆46Updated 4 months ago
zjunlp / LightThinker
[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression
☆121Updated 7 months ago
WooooDyy / MathCritique
Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".
☆56Updated 11 months ago
GeniusHTX / TALE
☆135Updated 2 months ago
YangLing0818 / SuperCorrect-llm
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
☆83Updated 7 months ago
ReasoningTransfer / Transferability-of-LLM-Reasoning
☆104Updated last month
RUC-NLPIR / Tool-Star
🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning
☆286Updated 3 weeks ago
WangHanLinHenry / SPA-RL-Agent
Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"
☆48Updated 2 months ago
maple-research-lab / SLOT
☆110Updated 5 months ago
cmu-l3 / l1
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
☆258Updated 6 months ago
RUCAIBox / R1-Searcher-plus
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
☆65Updated 5 months ago
multimodal-art-projection / REER_DeepWriter
REverse-Engineered Reasoning for Open-Ended Generation
☆79Updated 2 months ago
qhjqhj00 / awesome-agentic-search
🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…
☆47Updated 2 months ago
ADaM-BJTU / OpenRFT
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning
☆152Updated 10 months ago
IAAR-Shanghai / xVerify
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
☆138Updated 6 months ago
Dereck0602 / Awesome_Test_Time_LLMs
☆131Updated 8 months ago
hkgc-1 / GHPO
☆52Updated 3 months ago
Open-Source-O1 / o1_Reasoning_Patterns_Study
☆104Updated 11 months ago
yafuly / TPO
Test-time preferenece optimization (ICML 2025).
☆169Updated 6 months ago
dongguanting / FollowRAG
The demo, code and data of FollowRAG
☆75Updated 4 months ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆87Updated 9 months ago