[VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.
☆437Sep 8, 2025Updated 6 months ago
Alternatives and similar repositories for OmniSQL
Users that are interested in OmniSQL are comparing it to the libraries listed below
Sorting:
- OpenSearch-SQL code☆167May 30, 2025Updated 9 months ago
- [NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"☆131Nov 20, 2025Updated 3 months ago
- RSL-SQL: Robust Schema Linking in Text-to-SQL Generation☆158Sep 17, 2025Updated 6 months ago
- This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide pract…☆1,361Mar 3, 2026Updated 2 weeks ago
- 🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”☆140Oct 2, 2025Updated 5 months ago
- XiYanSQL models for Text-to-SQL.☆148Sep 3, 2025Updated 6 months ago
- CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning☆58Aug 12, 2025Updated 7 months ago
- LLM Prompting for Text2SQL via Gradual SQL Reffnement☆15Feb 19, 2025Updated last year
- The source code of CodeS (SIGMOD 2024).☆196Nov 20, 2024Updated last year
- a semi-structure representation of database schema☆216Jan 23, 2026Updated last month
- [ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows☆752Jan 30, 2026Updated last month
- Contextual Harnessing for Efficient SQL Synthesis☆266May 26, 2025Updated 9 months ago
- The source code for the schema filter (question + schema only)☆47May 13, 2024Updated last year
- A MULTI-GENERATOR ENSEMBLE FRAMEWORK FOR NATURAL LANGUAGE TO SQL☆979Feb 11, 2026Updated last month
- SLM-SQL: An Exploration of Small Language Models for Text-to-SQL☆31Aug 12, 2025Updated 7 months ago
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆67Mar 6, 2026Updated last week
- ☆26May 26, 2025Updated 9 months ago
- A efficient and effective few-shot NL2SQL method on GPT-4.☆625Mar 7, 2025Updated last year
- 2024金融行业大模型挑战赛-人生海海团队方案☆24May 31, 2025Updated 9 months ago
- Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.☆3,532Jan 26, 2026Updated last month
- 🦫 BEAVER: An Enterprise Benchmark for Text-to-SQL☆27May 23, 2025Updated 9 months ago
- [ICLR/AAAI 2026] Open-Source LLM-Based Data Analysis Agents☆73Jan 26, 2026Updated last month
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆32Sep 22, 2025Updated 5 months ago
- ☆148Nov 6, 2025Updated 4 months ago
- A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in T…☆1,967Jul 2, 2025Updated 8 months ago
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆151Jan 7, 2026Updated 2 months ago
- ☆61Nov 18, 2024Updated last year
- End-to-End Local-First Text-to-SQL Pipelines☆439Feb 14, 2025Updated last year
- 𝔸𝕄𝔹ℝ𝕆𝕊𝕀𝔸: A Benchmark for Parsing Ambiguous Questions into Database Queries☆14Oct 31, 2024Updated last year
- This repository contains all the code for the DTS-SQL paper☆54Jul 29, 2024Updated last year
- ☆13Jan 31, 2025Updated last year
- Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over Massive Database via Schema Routing" (EDBT 2025)☆134Aug 25, 2025Updated 6 months ago
- ICDE 2024 Paper, MetaSQL: A Generate-then-Rank Framework for Natural Language to SQL Translation☆26May 9, 2025Updated 10 months ago
- The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's …☆21Sep 18, 2025Updated 6 months ago
- MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL☆330Feb 27, 2025Updated last year
- MAG-SQL: Multi-Agent Generative Approach with Soft Schema Linking and Iterative Sub-SQL Refinement for Text-to-SQL☆18Jul 10, 2025Updated 8 months ago
- ☆52Dec 7, 2024Updated last year
- The Pytorch implementation of RESDSQL (AAAI 2023).☆277May 13, 2024Updated last year
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆278Updated this week