Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"
☆577Jun 4, 2026Updated this week
Alternatives and similar repositories for awesome-data-agents
Users that are interested in awesome-data-agents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🔥[NeurIPS'24] Official repository for the paper “Are Large Language Models Good Statisticians?”☆32Apr 13, 2025Updated last year
- Official repository for the paper “HAIChart: Human and AI Paired Visualization System” (VLDB'24)☆34Nov 4, 2024Updated last year
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆158May 26, 2026Updated last week
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆34Sep 22, 2025Updated 8 months ago
- Officical repository for the paper“ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering”(EMN…☆22Nov 16, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- an unstructured data analytics systems via LLM☆28Aug 7, 2025Updated 10 months ago
- Official repository for the paper “Learned Data-aware Image Representations of Line Charts for Similarity Search” (SIGMOD'23)☆13Jan 17, 2024Updated 2 years ago
- Official Repository of "LLM × DATA" Survey Paper☆785Mar 24, 2026Updated 2 months ago
- 🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”☆140Oct 2, 2025Updated 8 months ago
- 🔥 [NeurIPS'25] nvBench 2.0: Resolving Ambiguity in Text-to-Visualization through Stepwise Reasoning☆24Nov 13, 2025Updated 6 months ago
- datasets for database research☆15Aug 25, 2023Updated 2 years ago
- 🔥[NeurIPS'25] DeepFund: Pilot for Your Next Fund Investment☆277Mar 18, 2026Updated 2 months ago
- Efficient Hyper-parameter Tuning at Scale (VLDB'22)☆10Dec 1, 2021Updated 4 years ago
- ncNet is a Transformer-based model for supporting NL2VIS.☆44Sep 9, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide pract…☆1,485May 6, 2026Updated last month
- ☆20Jan 26, 2026Updated 4 months ago
- ☆13Jul 11, 2025Updated 10 months ago
- Putting Database Meeting Reports Together☆35Nov 10, 2023Updated 2 years ago
- [ICLR/AAAI/KDD2026] Open-Source LLM-Based Data Analysis Agents☆102Jun 1, 2026Updated last week
- Vega-Zero is a sequence-based visualization grammar based on Vega-Lite.☆24Sep 9, 2024Updated last year
- ☆24Aug 29, 2025Updated 9 months ago
- A repository for the Kramabench benchmark☆64Apr 13, 2026Updated last month
- ☆23Jul 25, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ArxivFlow - Periodic Track on arXiv Paper☆51May 8, 2026Updated last month
- [IJCAI'25 Workshop Oral] The 1st place solution of IJCAI 2025 challenge track 1: Image Detection and Localization☆37Dec 4, 2025Updated 6 months ago
- 🔥[KDD’26] Official repository for the paper “FDABench: A Benchmark for Data Agents on Analytical Queries over Heterogeneous Data”.☆66May 28, 2026Updated last week
- SEU Summer School project, based on Kotlin and Java.☆12Sep 15, 2023Updated 2 years ago
- [Archived] Move to agentsociety☆17Feb 6, 2025Updated last year
- [VLDB 25] Maximum Inner Product is Query-Scaled Nearest Neighbor☆40Oct 31, 2025Updated 7 months ago
- Awesome-Paper-list: Visualization meets LLM☆83Mar 26, 2026Updated 2 months ago
- ☆10Oct 28, 2020Updated 5 years ago
- ☆12May 29, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The source code for the schema filter (question + schema only)☆47May 13, 2024Updated 2 years ago
- Public Evaluation Result Archieve for BFCL☆30Dec 17, 2025Updated 5 months ago
- DeepEye: An Autonomous Data Agent System☆200Updated this week
- ☆62Jun 17, 2021Updated 4 years ago
- The tensorflow prototype of "Local Low-rank Matrix Approximation" (LLORMA)☆10Jan 11, 2019Updated 7 years ago
- ai4db and db4ai work☆822Dec 26, 2024Updated last year
- Must-read papers on network representation learning (NRL)/network embedding (NE)☆12Nov 9, 2017Updated 8 years ago