fdabench / FDAbenchLinks
FDABench, a benchmark for evaluating data agents' reasoning ability over heterogeneous data in analytical scenarios.
☆53Updated this week
Alternatives and similar repositories for FDAbench
Users that are interested in FDAbench are comparing it to the libraries listed below
Sorting:
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆875Updated this week
- Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm o…☆377Updated this week
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆145Updated last month
- Benchmark for automated failure attributions in agentic systems (🏆 ICML 2025 Spotlight)☆340Updated this week
- LLM-based Dialect Translation System☆76Updated 4 months ago
- Official Repository of "LLM × DATA" Survey Paper☆688Updated last week
- an unstructured data analytics systems via LLM☆23Updated 6 months ago
- A Systematic Survey of Deep Research☆299Updated last month
- Fine-Tuning Dataset Auto-Generation for Graph Query Languages.☆89Updated 2 months ago
- OpenLens AI: A Fully Autonomous Multimodal Research Agent| OpenLens AI:全自主多模态科研智能体☆233Updated this week
- Mitigating Lost-in-Retrieval Problems in Retrieval Augmented Multi-Hop Question Answering, ACL 2025☆18Updated 3 months ago
- [EMNLP 2025 Main] LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQL☆60Updated 7 months ago
- Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning☆129Updated 8 months ago
- 💡 Awesome RAG: A resource of Retrieval-Augmented Generation (RAG) for LLMs, focusing on the development of technology.☆423Updated 3 weeks ago
- CX-Mind: A Pioneering Multimodal Large Language Model for Interleaved Reasoning in Chest X-ray via Curriculum-Guided Reinforcement Lear…☆127Updated 2 months ago
- This is the official repo for GraphRAG-Bench: Challenging Domain-Specific Reasoning for Evaluating Graph Retrieval-Augmented Generation☆64Updated 6 months ago
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆30Updated 4 months ago
- In-depth study of the graphrag☆1,509Updated 7 months ago
- 五星大厨:全面Multi-Agent 的客服机器人,基于langraph实现,txt2sql ,txt2cypher, lightrag, 多模态 等☆122Updated 2 months ago
- Official repository for the paper "EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing".☆22Updated 6 months ago
- [NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"☆125Updated 2 months ago
- Agentic RAG R1 Framework via Reinforcement Learning☆380Updated last week
- 🔥[NeurIPS'24] Official repository for the paper “Are Large Language Models Good Statisticians?”☆32Updated 9 months ago
- LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence https://arxiv.org/abs/2509.03505☆3,000Updated last month
- ☆87Updated 10 months ago
- 🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”☆140Updated 4 months ago
- [arxiv: 2503.23895] Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement☆172Updated 5 months ago
- 记录我在cs336学习时的笔记和作业☆597Updated last week
- The official GitHub page for the survey paper "Towards Next-Generation LLM-based Recommender Systems: A Survey and Beyond". And this pape…☆244Updated 6 months ago
- A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration☆124Updated 6 months ago