MasterZhou1 / ReconLinks
Code for paper "Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs"
☆12Updated 5 months ago
Alternatives and similar repositories for Recon
Users that are interested in Recon are comparing it to the libraries listed below
Sorting:
- Official Repo for RuleReasoner.☆28Updated 5 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated last month
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆33Updated 3 months ago
- A holistic benchmark for LLM abstention☆61Updated 3 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆34Updated 2 months ago
- ⚔️ Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆46Updated 6 months ago
- ☆70Updated last month
- ☆18Updated 5 months ago
- ☆42Updated 5 months ago
- C^3-Bench: The Things Real Disturbing LLM based Agent in Multi-Tasking☆35Updated 5 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆36Updated 2 weeks ago
- SSRL: Self-Search Reinforcement Learning☆157Updated 3 months ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆38Updated 3 weeks ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆180Updated 4 months ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆187Updated 5 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆115Updated 5 months ago
- ☆65Updated 5 months ago
- A controlled benchmark on evaluating and studying the dynamics of Long Context Language Models☆24Updated last month
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆54Updated last month
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆87Updated 2 months ago
- ☆98Updated 4 months ago
- The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution☆154Updated this week
- ☆62Updated last month
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆42Updated 11 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆18Updated last year
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆24Updated 9 months ago
- ☆67Updated 8 months ago
- ☆29Updated 3 weeks ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆53Updated last year
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆108Updated 2 weeks ago