awslabs / unified-text2sql-benchmarkLinks
UNITE: A Unified Benchmark for Text-to-SQL Evaluation
☆82Updated 6 months ago
Alternatives and similar repositories for unified-text2sql-benchmark
Users that are interested in unified-text2sql-benchmark are comparing it to the libraries listed below
Sorting:
- ☆133Updated last month
- ☆401Updated last year
- The prediction results of ChatGPT on various datasets of Text-to-SQL.☆102Updated 2 years ago
- Semantic Evaluation for Text-to-SQL with Distilled Test Suites☆311Updated last year
- Introduction page of a challenging text-to-SQL dataset: KaggleDBQA☆40Updated 2 years ago
- ☆58Updated last year
- Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"☆33Updated 2 years ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆132Updated last year
- MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL☆311Updated 10 months ago
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆167Updated last year
- Numbers Station Text to SQL model code.☆254Updated 2 years ago
- Evaluate the accuracy of LLM generated outputs☆724Updated 4 months ago
- The code for the paper C3: Zero-shot Text-to-SQL with ChatGPT☆159Updated last year
- Comprehensive benchmark for RAG☆249Updated 6 months ago
- This repository contains all the code for the DTS-SQL paper☆53Updated last year
- RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Langua…☆406Updated 7 months ago
- Using Large Language Models (LLMs) to convert natural language queries to sql☆54Updated last year
- Contextual Harnessing for Efficient SQL Synthesis☆254Updated 7 months ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆283Updated 2 years ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆167Updated last year
- The source code of CodeS (SIGMOD 2024).☆194Updated last year
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆68Updated last year
- Automated Evaluation of RAG Systems☆681Updated 8 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆561Updated last week
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets☆193Updated last year
- The Pytorch implementation of RESDSQL (AAAI 2023).☆273Updated last year
- Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over Massive Database via Schema Routing" (EDBT 2025)☆129Updated 4 months ago
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆45Updated 2 years ago
- ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor☆299Updated 2 years ago
- ☆114Updated last year