DataArcTech / SQL-R1Links
[NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"
โ117Updated last month
Alternatives and similar repositories for SQL-R1
Users that are interested in SQL-R1 are comparing it to the libraries listed below
Sorting:
- ๐งTool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learningโ300Updated 2 months ago
- โ404Updated 2 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuningโ154Updated last year
- ๅจverlไธๅreward็ๅฎๅถๅผๅโ140Updated 7 months ago
- โ480Updated 2 months ago
- The official code of ARPO & AEPOโ843Updated this week
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.โ176Updated 6 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agentsโ263Updated last month
- โ325Updated 7 months ago
- A Survey on Multimodal Retrieval-Augmented Generationโ454Updated 2 months ago
- A Collection of Papers about Memory for Language Agentsโ245Updated 3 weeks ago
- ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generationโ55Updated 2 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learningโ667Updated 5 months ago
- ๐ A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyondโ325Updated last week
- A live reading list for LLM data synthesis (Updated to July, 2025).โ434Updated 4 months ago
- Awesome List for Agentic RLโ691Updated last month
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'โ32Updated 7 months ago
- Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.โ66Updated 2 months ago
- โ70Updated 6 months ago
- โ59Updated 11 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generationโ142Updated 10 months ago
- Latest Advances on Long Chain-of-Thought Reasoningโ596Updated 5 months ago
- A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Explorationโ115Updated 5 months ago
- โ101Updated 2 months ago
- โ176Updated last month
- DeepRAG: Thinking to Retrieve Step by Step for Large Language Modelsโ32Updated 7 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineeringโ209Updated 8 months ago
- โ161Updated 11 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (โฆโ464Updated this week
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.โ63Updated 2 months ago