weAIDB / awesome-data-llmLinks
Official Repository of "LLM × DATA" Survey Paper
☆580Updated last month
Alternatives and similar repositories for awesome-data-llm
Users that are interested in awesome-data-llm are comparing it to the libraries listed below
Sorting:
- an unstructured data analytics systems via LLM☆21Updated 4 months ago
- GPTuner is a manual-reading database tuning system leveraging domain knowlege automatically and extensively to enhance knob tuning proces…☆120Updated 5 months ago
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆131Updated last month
- The source code of CodeS (SIGMOD 2024).☆194Updated last year
- Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm o…☆317Updated this week
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆28Updated 2 months ago
- Official repository for the paper "EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing".☆20Updated 4 months ago
- This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide pract…☆1,188Updated 3 weeks ago
- 🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”☆137Updated 2 months ago
- DataMosaic: Explainable and Verifiable Document-Based Data Analytics☆20Updated 5 months ago
- Contextual Harnessing for Efficient SQL Synthesis☆254Updated 6 months ago
- PostgreSQL extension for supporting deep learning model inference within the database and vector storage☆57Updated 2 months ago
- PilotScope is a middleware to bridge the gaps of deploying AI4DB (Artificial Intelligence for Databases) algorithms into actual database …☆166Updated last year
- Collection of training data management explorations for large language models☆336Updated last year
- An LLM Based Diagnosis System (https://arxiv.org/pdf/2312.01454.pdf)☆680Updated 9 months ago
- [VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.☆395Updated 3 months ago
- 🏆 Winning NeurIPS (NIPS) Competition Track: Big ANN, Practical Vector Search Challenge 2023. (see big-ann-benchmark https://big-ann-benc…☆30Updated last year
- A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration☆113Updated 4 months ago
- ☆50Updated last year
- LLM-based Dialect Translation System☆74Updated 2 months ago
- The source code for the schema filter (question + schema only)☆48Updated last year
- 向量检索与 RAG 实践:技术、实现与应用☆141Updated last year
- A live reading list for LLM data synthesis (Updated to July, 2025).☆420Updated 3 months ago
- ai4db and db4ai work☆813Updated 11 months ago
- This project provides a demo for text-to-SQL based on CodeS.☆57Updated last year
- [ICDE 2024] VDTuner - Automated Performance Tuning for Vector Data Management Systems (Vector Databases)☆34Updated last year
- ☆94Updated last year
- MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL☆306Updated 9 months ago
- Fine-Tuning Dataset Auto-Generation for Graph Query Languages.☆83Updated 3 weeks ago
- ☆28Updated 3 weeks ago