DataArcTech / SQL-R1Links
[arXiv'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"
β85Updated 3 weeks ago
Alternatives and similar repositories for SQL-R1
Users that are interested in SQL-R1 are comparing it to the libraries listed below
Sorting:
- β318Updated 2 months ago
- π§Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learningβ243Updated 2 weeks ago
- Awesome Agent Trainingβ215Updated 3 weeks ago
- β361Updated 2 weeks ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuningβ148Updated 8 months ago
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyondβ286Updated 2 weeks ago
- An Awesome List of Agentic Model trained with Reinforcement Learningβ420Updated this week
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learningβ767Updated last month
- β274Updated 3 months ago
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.β164Updated last month
- β147Updated 3 months ago
- [TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Modelsβ580Updated this week
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learningβ248Updated 3 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learningβ625Updated 3 weeks ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsβ¦β244Updated 2 weeks ago
- A live reading list for LLM data synthesis (Updated to July, 2025).β366Updated this week
- β261Updated last month
- β405Updated last month
- A series of technical report on Slow Thinking with LLMβ726Updated 2 weeks ago
- A Survey on Multimodal Retrieval-Augmented Generationβ319Updated last week
- Survey on LLM Agents (Published on CoLing 2025)β377Updated 3 months ago
- Latest Advances on Long Chain-of-Thought Reasoningβ486Updated last month
- β67Updated 2 months ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.β150Updated last month
- β¨ Agentic Reinforced Policy Optimizationβ547Updated 2 weeks ago
- β108Updated 3 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!β69Updated 4 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineeringβ204Updated 4 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"β285Updated last month
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.β568Updated 4 months ago