weAIDB / awesome-data-llmLinks
Official Repository of "LLM × DATA" Survey Paper
☆551Updated 2 weeks ago
Alternatives and similar repositories for awesome-data-llm
Users that are interested in awesome-data-llm are comparing it to the libraries listed below
Sorting:
- GPTuner is a manual-reading database tuning system leveraging domain knowlege automatically and extensively to enhance knob tuning proces…☆120Updated 4 months ago
- an unstructured data analytics systems via LLM☆20Updated 3 months ago
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆122Updated 3 weeks ago
- The source code of CodeS (SIGMOD 2024).☆193Updated last year
- Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm o…☆270Updated this week
- 🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”☆137Updated last month
- This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide pract…☆1,139Updated 2 weeks ago
- PilotScope is a middleware to bridge the gaps of deploying AI4DB (Artificial Intelligence for Databases) algorithms into actual database …☆165Updated last year
- PostgreSQL extension for supporting deep learning model inference within the database and vector storage☆57Updated last month
- Official repository for the paper "EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing".☆20Updated 3 months ago
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆26Updated last month
- This project provides a demo for text-to-SQL based on CodeS.☆57Updated last year
- ☆14Updated 8 months ago
- LLM-based Dialect Translation System☆73Updated last month
- ☆50Updated 11 months ago
- Contextual Harnessing for Efficient SQL Synthesis☆254Updated 5 months ago
- A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration☆105Updated 3 months ago
- The source code for the schema filter (question + schema only)☆49Updated last year
- An LLM Based Diagnosis System (https://arxiv.org/pdf/2312.01454.pdf)☆677Updated 8 months ago
- ai4db and db4ai work☆809Updated 10 months ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆408Updated 2 months ago
- ☆21Updated 5 months ago
- Fine-Tuning Dataset Auto-Generation for Graph Query Languages.☆82Updated last week
- [VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.☆381Updated 2 months ago
- Collection of training data management explorations for large language models☆334Updated last year
- The code for the paper C3: Zero-shot Text-to-SQL with ChatGPT☆156Updated last year
- 🏆 Winning NeurIPS (NIPS) Competition Track: Big ANN, Practical Vector Search Challenge 2023. (see big-ann-benchmark https://big-ann-benc…☆30Updated last year
- MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL☆298Updated 8 months ago
- 🔥[NeurIPS'24] Official repository for the paper “Are Large Language Models Good Statisticians?”☆32Updated 7 months ago
- RSL-SQL: Robust Schema Linking in Text-to-SQL Generation☆149Updated 2 months ago