fdabench / FDAbenchLinks
FDABench is the first data agent benchmark specifically designed for evaluating agents in multi-source data analytical scenarios.
☆53Updated this week
Alternatives and similar repositories for FDAbench
Users that are interested in FDAbench are comparing it to the libraries listed below
Sorting:
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆854Updated this week
- Benchmark for automated failure attributions in agentic systems (🏆 ICML 2025 Spotlight)☆337Updated 2 weeks ago
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆144Updated 3 weeks ago
- Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm o…☆370Updated last month
- OpenLens AI: A Fully Autonomous Multimodal Research Agent| OpenLens AI:全自主多模态科研智能体☆221Updated this week
- an unstructured data analytics systems via LLM☆23Updated 5 months ago
- The official GitHub page for the survey paper "Towards Next-Generation LLM-based Recommender Systems: A Survey and Beyond". And this pape…☆243Updated 5 months ago
- ☆87Updated 10 months ago
- LLM-based Dialect Translation System☆75Updated 4 months ago
- Official Repository of "LLM × DATA" Survey Paper☆672Updated this week
- [ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation☆180Updated 2 months ago
- A Systematic Survey of Deep Research☆287Updated last month
- Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning☆129Updated 7 months ago
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆30Updated 4 months ago
- Comprehensive tools and frameworks for developing foundation models tailored to recommendation systems.☆1,046Updated 4 months ago
- Fine-Tuning Dataset Auto-Generation for Graph Query Languages.☆87Updated 2 months ago
- CX-Mind: A Pioneering Multimodal Large Language Model for Interleaved Reasoning in Chest X-ray via Curriculum-Guided Reinforcement Lear…☆126Updated 2 months ago
- Official code for ACL2025 "🔍 Retrieval Models Aren’t Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"☆210Updated last month
- [ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.☆185Updated last year
- ☆59Updated last year
- ☆26Updated 8 months ago
- Agentic RAG R1 Framework via Reinforcement Learning☆374Updated 2 weeks ago
- 基于RAG的知识问答系统,主要结合了 LLM、Langchain、提示工程、优化知识库结构和检索生成流程、vllm 推理优化框架等技术☆22Updated 10 months ago
- 记录我在cs336学习时的笔记和作业☆529Updated last week
- [NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"☆123Updated 2 months ago
- A LLM semantic caching system aiming to enhance user experience by reducing response time via cached query-result pairs.☆955Updated 7 months ago
- 🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”☆140Updated 4 months ago
- 💡 Awesome RAG: A resource of Retrieval-Augmented Generation (RAG) for LLMs, focusing on the development of technology.☆418Updated 2 weeks ago
- Trainable fast and memory-efficient sparse attention☆519Updated 2 weeks ago
- DataMosaic: Explainable and Verifiable Document-Based Data Analytics☆20Updated 7 months ago