DataArcTech / SQL-R1Links
[NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"
☆124Updated 2 months ago
Alternatives and similar repositories for SQL-R1
Users that are interested in SQL-R1 are comparing it to the libraries listed below
Sorting:
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆314Updated last month
- ☆427Updated 3 months ago
- ☆333Updated 8 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆155Updated last year
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆683Updated 6 months ago
- The official code of ARPO & AEPO☆880Updated last week
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.☆180Updated 7 months ago
- ☆490Updated 3 months ago
- ☆178Updated 2 months ago
- ☆275Updated 5 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆342Updated 2 weeks ago
- ☆70Updated 7 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆298Updated this week
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆185Updated 7 months ago
- A Collection of Papers about Memory for Language Agents☆310Updated 2 weeks ago
- Awesome List for Agentic RL☆760Updated last month
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 9 months ago
- A comprehensive collection of process reward models.☆135Updated 4 months ago
- Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework☆205Updated 3 weeks ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆258Updated 5 months ago
- ☆104Updated 3 months ago
- A Survey on Multimodal Retrieval-Augmented Generation☆468Updated 3 weeks ago
- Survey on LLM Agents (Published on CoLing 2025)☆470Updated 4 months ago
- 在verl上做reward的定制开发☆144Updated 8 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆605Updated 6 months ago
- llm & rl☆271Updated 3 months ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆449Updated 5 months ago
- EMNLP MAIN 2025 StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization☆57Updated 4 months ago
- a-m-team's exploration in large language modeling☆195Updated 8 months ago
- Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.☆70Updated 3 months ago