weAIDB / awesome-data-llmLinks
Official Repository of "LLM × DATA" Survey Paper
☆688Updated 2 weeks ago
Alternatives and similar repositories for awesome-data-llm
Users that are interested in awesome-data-llm are comparing it to the libraries listed below
Sorting:
- Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm o…☆388Updated last week
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆145Updated last month
- GPTuner is a manual-reading database tuning system leveraging domain knowlege automatically and extensively to enhance knob tuning proces…☆122Updated 7 months ago
- The source code of CodeS (SIGMOD 2024).☆195Updated last year
- an unstructured data analytics systems via LLM☆23Updated 6 months ago
- 🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”☆140Updated 4 months ago
- ☆26Updated 8 months ago
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆30Updated 4 months ago
- This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide pract…☆1,302Updated last week
- Contextual Harnessing for Efficient SQL Synthesis☆258Updated 8 months ago
- [VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.☆421Updated 5 months ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆449Updated 5 months ago
- Official repository for the paper "EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing".☆22Updated 6 months ago
- A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration☆124Updated 6 months ago
- [NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"☆125Updated 2 months ago
- ☆52Updated last year
- Fine-Tuning Dataset Auto-Generation for Graph Query Languages.☆90Updated last week
- DataMosaic: Explainable and Verifiable Document-Based Data Analytics☆20Updated 7 months ago
- The source code for the schema filter (question + schema only)☆47Updated last year
- Collection of training data management explorations for large language models☆337Updated last year
- This project provides a demo for text-to-SQL based on CodeS.☆57Updated last year
- MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL☆323Updated 11 months ago
- The code for the paper C3: Zero-shot Text-to-SQL with ChatGPT☆160Updated last year
- 🔥[NeurIPS'24] Official repository for the paper “Are Large Language Models Good Statisticians?”☆32Updated 10 months ago
- ☆94Updated last year
- ☆15Updated 2 weeks ago
- An Efficient "Factory" to Build Multiple LoRA Adapters☆370Updated last year
- We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理☆606Updated last month
- Survey on LLM Agents (Published on CoLing 2025)☆475Updated 4 months ago
- [ICDE 2024] VDTuner - Automated Performance Tuning for Vector Data Management Systems (Vector Databases)☆34Updated last year