awslabs / unified-text2sql-benchmarkLinks
UNITE: A Unified Benchmark for Text-to-SQL Evaluation
☆79Updated 3 months ago
Alternatives and similar repositories for unified-text2sql-benchmark
Users that are interested in unified-text2sql-benchmark are comparing it to the libraries listed below
Sorting:
- ☆105Updated last month
- ☆379Updated last year
- The prediction results of ChatGPT on various datasets of Text-to-SQL.☆102Updated 2 years ago
- Semantic Evaluation for Text-to-SQL with Distilled Test Suites☆294Updated last year
- Introduction page of a challenging text-to-SQL dataset: KaggleDBQA☆38Updated 2 years ago
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆165Updated 10 months ago
- RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Langua…☆392Updated 4 months ago
- MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL☆281Updated 6 months ago
- ☆50Updated 10 months ago
- The code for the paper C3: Zero-shot Text-to-SQL with ChatGPT☆154Updated last year
- Comprehensive benchmark for RAG☆215Updated 3 months ago
- Evaluate the accuracy of LLM generated outputs☆708Updated last month
- Numbers Station Text to SQL model code.☆250Updated 2 years ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆534Updated this week
- Contextual Harnessing for Efficient SQL Synthesis☆235Updated 3 months ago
- Automated Evaluation of RAG Systems☆656Updated 5 months ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆131Updated last year
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets☆177Updated last year
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆277Updated 2 years ago
- Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over Massive Database via Schema Routing" (EDBT 2025)☆117Updated 3 weeks ago
- The source code for the schema filter (question + schema only)☆47Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆161Updated last year
- Benchmark baseline for retrieval qa applications☆116Updated last year
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)☆365Updated 5 months ago
- The Pytorch implementation of RESDSQL (AAAI 2023).☆264Updated last year
- [ACL Findings 2024] Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm☆44Updated last year
- Using Large Language Models (LLMs) to convert natural language queries to sql☆50Updated 11 months ago
- The source code of CodeS (SIGMOD 2024).☆184Updated 10 months ago
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆67Updated last year
- Benchmarking library for RAG☆226Updated 2 months ago