[VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.
☆443Sep 8, 2025Updated 7 months ago
Alternatives and similar repositories for OmniSQL
Users that are interested in OmniSQL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenSearch-SQL code☆167May 30, 2025Updated 10 months ago
- [NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"☆137Nov 20, 2025Updated 5 months ago
- RSL-SQL: Robust Schema Linking in Text-to-SQL Generation☆161Sep 17, 2025Updated 7 months ago
- This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide pract…☆1,420Updated this week
- 🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”☆140Oct 2, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- XiYanSQL models for Text-to-SQL.☆151Sep 3, 2025Updated 7 months ago
- a semi-structure representation of database schema☆217Jan 23, 2026Updated 3 months ago
- LLM Prompting for Text2SQL via Gradual SQL Reffnement☆15Feb 19, 2025Updated last year
- CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning☆63Aug 12, 2025Updated 8 months ago
- The source code of CodeS (SIGMOD 2024).☆197Nov 20, 2024Updated last year
- [ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows☆793Jan 30, 2026Updated 2 months ago
- Contextual Harnessing for Efficient SQL Synthesis☆268May 26, 2025Updated 11 months ago
- The source code for the schema filter (question + schema only)☆47May 13, 2024Updated last year
- A MULTI-GENERATOR ENSEMBLE FRAMEWORK FOR NATURAL LANGUAGE TO SQL☆994Feb 11, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SLM-SQL: An Exploration of Small Language Models for Text-to-SQL☆32Aug 12, 2025Updated 8 months ago
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆67Mar 6, 2026Updated last month
- ☆25May 26, 2025Updated 11 months ago
- A efficient and effective few-shot NL2SQL method on GPT-4.☆629Mar 7, 2025Updated last year
- 2024金融行业大模型挑战赛-人生海海团队方案☆24May 31, 2025Updated 10 months ago
- Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.☆3,603Jan 26, 2026Updated 3 months ago
- 𝔸𝕄𝔹ℝ𝕆𝕊𝕀𝔸: A Benchmark for Parsing Ambiguous Questions into Database Queries☆15Oct 31, 2024Updated last year
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆33Sep 22, 2025Updated 7 months ago
- 🦫 BEAVER: An Enterprise Benchmark for Text-to-SQL☆35May 23, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆159Nov 6, 2025Updated 5 months ago
- A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in T…☆1,978Jul 2, 2025Updated 9 months ago
- ☆61Nov 18, 2024Updated last year
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆157Jan 7, 2026Updated 3 months ago
- [ICLR/AAAI 2026] Open-Source LLM-Based Data Analysis Agents☆82Jan 26, 2026Updated 3 months ago
- End-to-End Local-First Text-to-SQL Pipelines☆452Feb 14, 2025Updated last year
- This repository contains all the code for the DTS-SQL paper☆55Jul 29, 2024Updated last year
- ☆13Jan 31, 2025Updated last year
- Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over Massive Database via Schema Routing" (EDBT 2025)☆134Aug 25, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ICDE 2024 Paper, MetaSQL: A Generate-then-Rank Framework for Natural Language to SQL Translation☆27May 9, 2025Updated 11 months ago
- MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL☆334Feb 27, 2025Updated last year
- ☆52Dec 7, 2024Updated last year
- The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's …☆22Sep 18, 2025Updated 7 months ago
- ☆122Apr 4, 2026Updated 3 weeks ago
- The Pytorch implementation of RESDSQL (AAAI 2023).☆279May 13, 2024Updated last year
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆282Updated this week