bird-bench / bird-bench.github.ioLinks
☆20Updated last week
Alternatives and similar repositories for bird-bench.github.io
Users that are interested in bird-bench.github.io are comparing it to the libraries listed below
Sorting:
- ☆141Updated 3 months ago
- The source code of CodeS (SIGMOD 2024).☆194Updated last year
- 🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”☆140Updated 4 months ago
- Semantic Evaluation for Text-to-SQL with Distilled Test Suites☆313Updated last year
- A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration☆123Updated 6 months ago
- ☆59Updated last year
- [ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows☆720Updated last week
- MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL☆320Updated 11 months ago
- The source code for the schema filter (question + schema only)☆47Updated last year
- This project provides a demo for text-to-SQL based on CodeS.☆57Updated last year
- This repository contains all the code for the DTS-SQL paper☆54Updated last year
- ☆404Updated last year
- Contextual Harnessing for Efficient SQL Synthesis☆258Updated 8 months ago
- The prediction results of ChatGPT on various datasets of Text-to-SQL.☆103Updated 2 years ago
- The Pytorch implementation of RESDSQL (AAAI 2023).☆275Updated last year
- The code for the paper C3: Zero-shot Text-to-SQL with ChatGPT☆160Updated last year
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆145Updated last month
- ☆12Updated 2 years ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆136Updated last year
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆30Updated 4 months ago
- The code base for paper: "ReAcTable: Enhancing ReAct for Table Question Answering"☆35Updated last year
- Please visit https://github.com/HKUSTDial/NL2SQL360 to get the official code!☆10Updated last year
- ☆114Updated last year
- ☆27Updated 2 years ago
- LexEval: A Comprehensive Benchmark for Evaluating Large Language Models in Legal Domain☆88Updated last year
- A efficient and effective few-shot NL2SQL method on GPT-4.☆616Updated 11 months ago
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆136Updated last year
- UNITE: A Unified Benchmark for Text-to-SQL Evaluation☆84Updated 8 months ago
- Official Repository of "LLM × DATA" Survey Paper☆688Updated last week
- 🔥[NeurIPS'24] Official repository for the paper “Are Large Language Models Good Statisticians?”☆32Updated 9 months ago