Official Repository of "LLM × DATA" Survey Paper
☆794Jun 15, 2026Updated 2 weeks ago
Alternatives and similar repositories for awesome-data-llm
Users that are interested in awesome-data-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ai4db and db4ai work☆824Dec 26, 2024Updated last year
- ☆13Jul 11, 2025Updated 11 months ago
- Papers for database systems powered by artificial intelligence (machine learning for database)☆776Apr 21, 2026Updated 2 months ago
- Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm o…☆617Jun 10, 2026Updated 2 weeks ago
- Paper repository for "SWIRL: Selection of Workload-aware Indexes using Reinforcement Learning" (EDBT 2022)☆41Jul 12, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Aug 17, 2023Updated 2 years ago
- A prototype implementation of Bao for PostgreSQL☆221Sep 17, 2024Updated last year
- An LLM Based Diagnosis System (https://arxiv.org/pdf/2312.01454.pdf)☆709Dec 27, 2025Updated 6 months ago
- ☆21Jul 20, 2024Updated last year
- The DSB benchmark is designed for evaluating both workloaddriven and traditional database systems on modern decision support workloads. D…☆76Nov 8, 2024Updated last year
- A Benchmark for Transactional Database Performance Anomalies☆12Nov 21, 2023Updated 2 years ago
- This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide pract…☆1,510May 6, 2026Updated last month
- An online logical query rewrite demo (schema+sql only)!☆40Jul 25, 2023Updated 2 years ago
- A Pretrained Model for Cross-Database Cardinality Estimation☆34Apr 30, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A new CardEst Benchmark to Bridge AI and DBMS☆136Mar 14, 2023Updated 3 years ago
- datasets for database research☆15Aug 25, 2023Updated 2 years ago
- ☆53Nov 26, 2024Updated last year
- Join Order Benchmark (implicit fork of https://github.com/gregrahn/join-order-benchmark)☆24Jun 9, 2026Updated 3 weeks ago
- Characterization of relational table embeddings (VLDB 2024).☆32Jul 1, 2024Updated last year
- A Vagrant box that automatically loads the IMDB dataset into Postgres☆82Mar 22, 2024Updated 2 years ago
- 🔥[NeurIPS'24] Official repository for the paper “Are Large Language Models Good Statisticians?”☆32Apr 13, 2025Updated last year
- 🔥[KDD’26] Official repository for the paper “FDABench: A Benchmark for Data Agents on Analytical Queries over Heterogeneous Data”.☆70Jun 7, 2026Updated 3 weeks ago
- blah☆35May 5, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The source code of the Sudowoodo paper in ICDE 2023☆19May 24, 2023Updated 3 years ago
- Paper related to AI4DB techniques☆114Apr 14, 2026Updated 2 months ago
- ☆21Dec 2, 2025Updated 6 months ago
- A System for Optimized Semantic Computation☆228May 22, 2026Updated last month
- GPTuner is a manual-reading database tuning system leveraging domain knowlege automatically and extensively to enhance knob tuning proces…☆128Jul 3, 2025Updated 11 months ago
- [ICDE 2024] VDTuner - Automated Performance Tuning for Vector Data Management Systems (Vector Databases)☆35Apr 21, 2024Updated 2 years ago
- A Unified Transferable Model for ML-Enhanced DBMS☆14Feb 2, 2022Updated 4 years ago
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆34Sep 22, 2025Updated 9 months ago
- ☆10Nov 16, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Join Order Benchmark (JOB)☆362Feb 16, 2025Updated last year
- An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads☆31Oct 10, 2024Updated last year
- Implementation of our VLDB'22 paper "Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction"☆55Nov 11, 2022Updated 3 years ago
- 🔥[VLDB'26] Official repository for the paper "LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning".☆112Jun 3, 2025Updated last year
- Codes for building an AI-native database☆77Jul 29, 2024Updated last year
- LLM for Index Recommendation☆18Mar 20, 2026Updated 3 months ago
- Collection of training data management explorations for large language models☆341Aug 2, 2024Updated last year