ruiqi-zhong / TestSuiteEvalLinks
"Semantic Evaluation for Text-to-SQL with Distilled Test Suite", EMNLP2020
☆40Updated 4 years ago
Alternatives and similar repositories for TestSuiteEval
Users that are interested in TestSuiteEval are comparing it to the libraries listed below
Sorting:
- Code for the EMNLP 2020 paper "Re-examining the Role of Schema Linking in Text-to-SQL".☆28Updated 4 years ago
- ☆31Updated 3 years ago
- ☆18Updated 3 years ago
- Data and Code Release for "On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries"☆54Updated 4 years ago
- Source code for Grounded Adaptation for Zero-shot Executable Semantic Parsing☆21Updated 4 years ago
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆82Updated 2 years ago
- DuoRAT is a ServiceNow Research project that was started at Element AI.☆56Updated 2 years ago
- ☆82Updated 2 years ago
- ☆46Updated 2 years ago
- ☆131Updated last year
- Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing☆42Updated 4 years ago
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆83Updated 2 years ago
- ☆18Updated 3 years ago
- Using self-play to augment multi-turn text-to-SQL datasets☆11Updated 2 years ago
- scripts and baselines for SParC: Yale & Salesforce Semantic Parsing and Text-to-SQL in Context Challenge☆76Updated 2 years ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆73Updated 3 years ago
- Repository for Decomposed Prompting☆94Updated last year
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆91Updated 5 months ago
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022☆159Updated last year
- ☆15Updated 4 years ago
- ☆117Updated 2 years ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year
- ☆88Updated 2 years ago
- The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Te…☆31Updated 4 years ago
- The code and data for paper "Large Language Models are few(1)-shot Table Reasoners" [EACL2023]☆47Updated last year
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆22Updated last year
- Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"☆31Updated 2 years ago
- A comprehensive paper list of Reasoning over Tables.☆28Updated 2 years ago
- A dataset of complex questions on semi-structured Wikipedia tables☆169Updated 4 years ago
- ☆84Updated 3 years ago