Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"
☆513Apr 1, 2026Updated 2 weeks ago
Alternatives and similar repositories for awesome-data-agents
Users that are interested in awesome-data-agents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🔥[NeurIPS'24] Official repository for the paper “Are Large Language Models Good Statisticians?”☆32Apr 13, 2025Updated last year
- Official repository for the paper “HAIChart: Human and AI Paired Visualization System” (VLDB'24)☆34Nov 4, 2024Updated last year
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆156Jan 7, 2026Updated 3 months ago
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆33Sep 22, 2025Updated 6 months ago
- Officical repository for the paper“ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering”(EMN…☆22Nov 16, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- an unstructured data analytics systems via LLM☆24Aug 7, 2025Updated 8 months ago
- Official repository for the paper “Learned Data-aware Image Representations of Line Charts for Similarity Search” (SIGMOD'23)☆13Jan 17, 2024Updated 2 years ago
- Official Repository of "LLM × DATA" Survey Paper☆766Mar 24, 2026Updated 3 weeks ago
- 🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”☆141Oct 2, 2025Updated 6 months ago
- 🔥 [NeurIPS'25] nvBench 2.0: Resolving Ambiguity in Text-to-Visualization through Stepwise Reasoning☆23Nov 13, 2025Updated 5 months ago
- 🔥[VLDB'26] Official repository for the paper "LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning".☆109Jun 3, 2025Updated 10 months ago
- 🔥[NeurIPS'25] DeepFund: Pilot for Your Next Fund Investment☆265Mar 18, 2026Updated last month
- Efficient Hyper-parameter Tuning at Scale (VLDB'22)☆10Dec 1, 2021Updated 4 years ago
- ncNet is a Transformer-based model for supporting NL2VIS.☆44Sep 9, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide pract…☆1,404Apr 1, 2026Updated 2 weeks ago
- A Structured Grammar for Chart Annotation☆15May 8, 2025Updated 11 months ago
- ☆20Jan 26, 2026Updated 2 months ago
- ☆12Jul 11, 2025Updated 9 months ago
- This is the reading list of Large Language Model-Based Data Science Agent☆40Nov 3, 2025Updated 5 months ago
- Putting Database Meeting Reports Together☆35Nov 10, 2023Updated 2 years ago
- Vega-Zero is a sequence-based visualization grammar based on Vega-Lite.☆24Sep 9, 2024Updated last year
- ☆24Aug 29, 2025Updated 7 months ago
- ☆15Nov 7, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆88Aug 11, 2021Updated 4 years ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆17Aug 15, 2025Updated 8 months ago
- [ICLR/AAAI 2026] Open-Source LLM-Based Data Analysis Agents☆79Jan 26, 2026Updated 2 months ago
- FDABench, a benchmark for evaluating data agents' reasoning ability over heterogeneous data in analytical scenarios.☆57Feb 18, 2026Updated 2 months ago
- [Archived] Move to agentsociety☆17Feb 6, 2025Updated last year
- [VLDB 25] Maximum Inner Product is Query-Scaled Nearest Neighbor☆38Oct 31, 2025Updated 5 months ago
- Paper repository for "SWIRL: Selection of Workload-aware Indexes using Reinforcement Learning" (EDBT 2022)☆40Jul 12, 2025Updated 9 months ago
- APIs of DeepEye. DeepEye: Towards Automatic Data Visualization [ICDE 2018]☆164Sep 9, 2024Updated last year
- The source code for the schema filter (question + schema only)☆47May 13, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A crowd-powered database system, with SQL-like query interface, multi-goal optimization☆11Sep 4, 2017Updated 8 years ago
- A new query hardness measure for graph-based ANN indexes. Build unbiased workloads with this hardness to see the actual performance of yo…☆22Feb 7, 2025Updated last year
- Unsupervised Anomaly Detection System for Univariate Time Series☆20Sep 25, 2024Updated last year
- [IJCAI'24] Official code for our paper "Make Graph Neural Networks Great Again: A Generic Integration Paradigm of Topology-Free Patterns …☆15Jul 3, 2025Updated 9 months ago
- ☆45Jul 28, 2021Updated 4 years ago
- [AAAI'25] The implementation of paper "Federated Foundation Models on Heterogeneous Time Series" | The first work to explore time series …☆22Feb 2, 2026Updated 2 months ago
- DataFountain 疫情政务问答助手解决方案分享☆16May 2, 2020Updated 5 years ago