xlang-ai / Spider2
[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
☆343Updated this week
Alternatives and similar repositories for Spider2:
Users that are interested in Spider2 are comparing it to the libraries listed below
- Contextual Harnessing for Efficient SQL Synthesis☆175Updated 3 months ago
- The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.☆281Updated 2 months ago
- "GraphAgent: Agentic Graph Language Assistant"☆267Updated 3 weeks ago
- GraphRAG-survey: A curated list of resources on graph-based retrieval-augmented generation.☆602Updated this week
- A MULTI-GENERATOR ENSEMBLE FRAMEWORK FOR NATURAL LANGUAGE TO SQL☆413Updated this week
- Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?” (VLDB'24)☆94Updated 5 months ago
- MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL☆226Updated this week
- STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (NeurIPS D&B 2024)☆300Updated 2 months ago
- This is a continuously updated handbook for readers to easily track the latest NL2SQL (Text-to-SQL) techniques in the literature and prov…☆391Updated this week
- ☆99Updated 10 months ago
- [ACL Findings 2024] Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm☆35Updated 6 months ago
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆291Updated 4 months ago
- The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://a…☆287Updated 3 months ago
- A efficient and effective few-shot NL2SQL method on GPT-4.☆492Updated 8 months ago
- ☆53Updated 4 months ago
- This is the repository for the Tool Learning survey.☆315Updated 2 weeks ago
- The source code of CodeS (SIGMOD 2024).☆157Updated 3 months ago
- "AnyGraph: Graph Foundation Model in the Wild"☆207Updated 5 months ago
- TAG-Bench: A benchmark for table-augmented generation (TAG)☆679Updated 2 weeks ago
- ☆353Updated 11 months ago
- AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning (NeurIPS 2024)☆181Updated last week
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆225Updated 6 months ago
- The code for the paper C3: Zero-shot Text-to-SQL with ChatGPT☆138Updated 6 months ago
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆108Updated 2 months ago
- Corrective Retrieval Augmented Generation☆349Updated 4 months ago
- Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over Massive Database: via Schema Routing" (EDBT 2025)☆68Updated last week
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆663Updated this week
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆146Updated 3 months ago
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆174Updated 5 months ago
- ☆246Updated 10 months ago