MasterZhou1 / ReconLinks
Code for paper "Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs"
☆12Updated 7 months ago
Alternatives and similar repositories for Recon
Users that are interested in Recon are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆30Updated last week
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆33Updated 5 months ago
- A holistic benchmark for LLM abstention☆69Updated 5 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 4 months ago
- ☆18Updated 7 months ago
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆27Updated 3 months ago
- ☆46Updated 7 months ago
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆49Updated last week
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆37Updated 4 months ago
- C^3-Bench: The Things Real Disturbing LLM based Agent in Multi-Tasking☆36Updated 7 months ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆42Updated 2 months ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆26Updated 11 months ago
- SING: SDE Inference via Natural Gradients☆36Updated last month
- When Reasoning Meets Its Laws☆35Updated last month
- ☆17Updated 6 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆39Updated this week
- Resa: Transparent Reasoning Models via SAEs☆47Updated 4 months ago
- ☆71Updated 3 months ago
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆34Updated 2 weeks ago
- SSRL: Self-Search Reinforcement Learning☆206Updated 5 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆100Updated 6 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆115Updated last month
- BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution☆58Updated 3 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Updated 7 months ago
- A controlled benchmark on evaluating and studying the dynamics of Long Context Language Models☆25Updated 3 months ago
- ☆29Updated 3 months ago
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆44Updated last year
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Updated last year
- the open-source code of QAgent☆52Updated 3 months ago