weAIDB / awesome-data-llmLinks
Official Repository of "LLM × DATA" Survey Paper
☆609Updated last week
Alternatives and similar repositories for awesome-data-llm
Users that are interested in awesome-data-llm are comparing it to the libraries listed below
Sorting:
- Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm o…☆333Updated last week
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆139Updated 2 months ago
- an unstructured data analytics systems via LLM☆22Updated 4 months ago
- GPTuner is a manual-reading database tuning system leveraging domain knowlege automatically and extensively to enhance knob tuning proces…☆120Updated 5 months ago
- The source code of CodeS (SIGMOD 2024).☆194Updated last year
- Fine-Tuning Dataset Auto-Generation for Graph Query Languages.☆84Updated last month
- 🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”☆139Updated 2 months ago
- A live reading list for LLM data synthesis (Updated to July, 2025).☆431Updated 4 months ago
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆29Updated 3 months ago
- ai4db and db4ai work☆814Updated last year
- 向量检索与 RAG 实践:技术、实现与应用☆144Updated last year
- Official repository for the paper "EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing".☆21Updated 5 months ago
- Collection of training data management explorations for large language models☆336Updated last year
- DataMosaic: Explainable and Verifiable Document-Based Data Analytics☆20Updated 6 months ago
- ☆95Updated last year
- Contextual Harnessing for Efficient SQL Synthesis☆255Updated 7 months ago
- PilotScope is a middleware to bridge the gaps of deploying AI4DB (Artificial Intelligence for Databases) algorithms into actual database …☆165Updated last year
- An LLM Based Diagnosis System (https://arxiv.org/pdf/2312.01454.pdf)☆685Updated 10 months ago
- 🏆 Winning NeurIPS (NIPS) Competition Track: Big ANN, Practical Vector Search Challenge 2023. (see big-ann-benchmark https://big-ann-benc…☆30Updated last year
- ☆51Updated last year
- This project provides a demo for text-to-SQL based on CodeS.☆57Updated last year
- This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide pract…☆1,223Updated last week
- ☆29Updated last month
- [NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"☆117Updated last month
- [ICDE 2024] VDTuner - Automated Performance Tuning for Vector Data Management Systems (Vector Databases)☆33Updated last year
- Survey on LLM Agents (Published on CoLing 2025)☆450Updated 2 months ago
- [VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.☆405Updated 3 months ago
- A System for Optimized Semantic Computation☆181Updated this week
- A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration☆114Updated 4 months ago
- Papers for database systems powered by artificial intelligence (machine learning for database)☆766Updated last week