DataArcTech / SQL-R1Links
[NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"
โ115Updated last month
Alternatives and similar repositories for SQL-R1
Users that are interested in SQL-R1 are comparing it to the libraries listed below
Sorting:
- ๐งTool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learningโ294Updated last month
- โ398Updated 2 months ago
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.โ177Updated 5 months ago
- โ448Updated 2 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuningโ154Updated 11 months ago
- โ319Updated 6 months ago
- A Survey on Multimodal Retrieval-Augmented Generationโ443Updated last month
- ๅจverlไธๅreward็ๅฎๅถๅผๅโ135Updated 6 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agentsโ248Updated 3 weeks ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learningโ666Updated 4 months ago
- Awesome List for Agentic RLโ632Updated last week
- ๐ A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyondโ319Updated 2 months ago
- A curated list of awesome works in Routing LLMs paradigm (๐ Welcome to submit your contributions to this code repository)โ100Updated 3 weeks ago
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'โ32Updated 7 months ago
- The official code of ARPO & AEPOโ829Updated last month
- โ98Updated last month
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.โ52Updated 4 months ago
- โ59Updated 11 months ago
- โ249Updated 4 months ago
- Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.โ63Updated last month
- A Collection of Papers about Memory for Language Agentsโ207Updated this week
- โ173Updated 2 weeks ago
- โ69Updated 6 months ago
- โ40Updated 9 months ago
- Latest Advances on Long Chain-of-Thought Reasoningโ573Updated 5 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineeringโ209Updated 7 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsโฆโ254Updated 4 months ago
- A research repo for experiments about Reinforcement Finetuningโ53Updated 8 months ago
- A live reading list for LLM data synthesis (Updated to July, 2025).โ424Updated 3 months ago
- โ292Updated 5 months ago