HKUSTDial/awesome-data-agents

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HKUSTDial/awesome-data-agents)

HKUSTDial / awesome-data-agents

Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"

☆647

Alternatives and similar repositories for awesome-data-agents

Users that are interested in awesome-data-agents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HKUSTDial / StatQA
View on GitHub
🔥[NeurIPS'24] Official repository for the paper “Are Large Language Models Good Statisticians?”
☆32Apr 13, 2025Updated last year
HKUSTDial / HAIChart
View on GitHub
Official repository for the paper “HAIChart: Human and AI Paired Visualization System” (VLDB'24)
☆36Nov 4, 2024Updated last year
HKUSTDial / DeepEye-SQL
View on GitHub
🔥[SIGMOD'26] Official repository for the paper "DeepEye-SQL: A Software-Engineering-Inspired Text-to-SQL Framework"
☆78Jul 11, 2026Updated last week
zzhang393 / DataMosaic-1.0
View on GitHub
DataMosaic: Explainable and Verifiable Document-Based Data Analytics
☆20Jun 30, 2025Updated last year
HKUSTDial / EllieSQL
View on GitHub
Official repository for the paper "EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing".
☆25Jul 24, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HKUSTDial / Alpha-SQL
View on GitHub
🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"
☆163Jun 10, 2026Updated last month
HKUSTDial / nvBench-2.0
View on GitHub
🔥 [NeurIPS'25] nvBench 2.0: Resolving Ambiguity in Text-to-Visualization through Stepwise Reasoning
☆26Nov 13, 2025Updated 8 months ago
HKUSTDial / DPC
View on GitHub
🔥[ACL'26 (Main)] Official repository for the paper "DPC: Training-Free Text-to-SQL Candidate Selection via Dual-Paradigm Consistency"
☆15Apr 26, 2026Updated 2 months ago
HKUSTDial / NL2SQL_Handbook
View on GitHub
This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide pract…
☆1,536May 6, 2026Updated 2 months ago
OpenDataBox / awesome-data-llm
View on GitHub
Official Repository of "LLM × DATA" Survey Paper
☆803Jun 15, 2026Updated last month
HKUSTDial / NL2SQL360
View on GitHub
🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”
☆141Oct 2, 2025Updated 9 months ago
DEFENSE-SEU / PeroMAS
View on GitHub
☆24Apr 16, 2026Updated 3 months ago
HKUSTDial / VisJudgeBench
View on GitHub
VisJudgeBench: A comprehensive benchmark for aesthetics and quality assessment of visualizations, featuring 3,090 expert-annotated sample…
☆122Feb 4, 2026Updated 5 months ago
HKUSTDial / NL2SQL-Bugs-Benchmark
View on GitHub
🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.
☆34Sep 22, 2025Updated 9 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
DEFENSE-SEU / Sci-VLA
View on GitHub
☆18Updated this week
FoundationAgents / foundation-protocol
View on GitHub
A Python runtime for multi-entity AI collaboration — agents, humans, and tools on a shared protocol layer.
☆50Jun 18, 2026Updated last month
HKUSTDial / ChartInsights
View on GitHub
Officical repository for the paper“ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering”(EMN…
☆22Nov 16, 2024Updated last year
zjunlp / DataMind
View on GitHub
[ICLR/AAAI/KDD2026] Open-Source LLM-Based Data Analysis Agents
☆121Jul 10, 2026Updated last week
HKUSTDial / ChartMark
View on GitHub
A Structured Grammar for Chart Annotation
☆15May 8, 2025Updated last year
ruc-datalab / DeepAnalyze
View on GitHub
DeepAnalyze is the first agentic LLM for autonomous data science. 🎈你的AI数据分析师，自动分析大量数据，一键生成专业分析报告！
☆4,380Jul 1, 2026Updated 2 weeks ago
DEFENSE-SEU / Code2MCP
View on GitHub
Official Repo of "Code2MCP: Transforming Code Repositories into MCP Services", Scaling Environments for Agents Workshop @ NeurIPS 2025
☆131Nov 4, 2025Updated 8 months ago
THUDM / DataSciBench
View on GitHub
DataSciBench: An LLM Agent Benchmark for Data Science (Findings of ACL 2026)
☆64Jan 21, 2026Updated 6 months ago
mitdbg / Kramabench
View on GitHub
A repository for the Kramabench benchmark
☆68Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
IatomicreactorI / CSGOTrading
View on GitHub
This is an official github repo for CSGOTrading project.
☆134Jan 31, 2026Updated 5 months ago
fdabench / FDAbench
View on GitHub
🔥[KDD’26] Official repository for the paper “FDABench: A Benchmark for Data Agents on Analytical Queries over Heterogeneous Data”.
☆75Updated this week
HKUSTDial / DataMagic
View on GitHub
AI-powered data-to-video generation. Upload a table, get a narrated animated data story.
☆113Jul 7, 2026Updated 2 weeks ago
HKUSTDial / LineNet-and-LineBench-SIGMOD2023
View on GitHub
Official repository for the paper “Learned Data-aware Image Representations of Line Charts for Similarity Search” (SIGMOD'23)
☆13Jan 17, 2024Updated 2 years ago
TsinghuaDatabaseGroup / nvBench
View on GitHub
☆89Aug 11, 2021Updated 4 years ago
LiqiangJing / DSBench
View on GitHub
[ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?
☆125Aug 17, 2025Updated 11 months ago
Evanwu1125 / AutoWebWorld
View on GitHub
☆25Jul 10, 2026Updated last week
Evanwu1125 / LiteCoT
View on GitHub
☆17Jun 10, 2025Updated last year
code4DB / Index_EAB
View on GitHub
☆13Jul 11, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
DEFENSE-SEU / RobustFlow
View on GitHub
Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"
☆238Oct 19, 2025Updated 9 months ago
ChartGalaxy / ChartGalaxy
View on GitHub
☆245Apr 19, 2026Updated 3 months ago
HKUSTDial / MAR
View on GitHub
Official repository for the paper “MAR: Matching-Augmented Reasoning for Enhancing Visual-based Entity Question Answering ” (EMNLP'24)
☆19Nov 10, 2024Updated last year
HKUSTDial / Vega-Zero
View on GitHub
Vega-Zero is a sequence-based visualization grammar based on Vega-Lite.
☆24Sep 9, 2024Updated last year
Snowflake-Labs / ReFoRCE
View on GitHub
A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration
☆139Aug 1, 2025Updated 11 months ago
DEFENSE-SEU / MCP-Github-Agent
View on GitHub
Automatically convert GitHub repo into MCP service
☆14Sep 10, 2025Updated 10 months ago
TsinghuaDatabaseGroup / datasets
View on GitHub
datasets for database research
☆15Aug 25, 2023Updated 2 years ago