ruiqi-zhong / TestSuiteEvalLinks
"Semantic Evaluation for Text-to-SQL with Distilled Test Suite", EMNLP2020
☆40Updated 4 years ago
Alternatives and similar repositories for TestSuiteEval
Users that are interested in TestSuiteEval are comparing it to the libraries listed below
Sorting:
- ☆31Updated 4 years ago
- ☆18Updated 3 years ago
- Data and Code Release for "On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries"☆54Updated 4 years ago
- Code for the EMNLP 2020 paper "Re-examining the Role of Schema Linking in Text-to-SQL".☆28Updated 4 years ago
- Source code for Grounded Adaptation for Zero-shot Executable Semantic Parsing☆21Updated 4 years ago
- DuoRAT is a ServiceNow Research project that was started at Element AI.☆56Updated 2 years ago
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆82Updated 2 years ago
- ☆82Updated 2 years ago
- Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"☆33Updated 2 years ago
- ☆46Updated 2 years ago
- ☆18Updated 3 years ago
- Repository for Decomposed Prompting☆95Updated last year
- A comprehensive paper list of Reasoning over Tables.☆29Updated 2 years ago
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022☆163Updated last year
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆80Updated last year
- Code for the paper "Open Domain Question Answering with A Unified Knowledge Interface" (ACL 2022)☆56Updated 2 years ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆73Updated 3 years ago
- ☆28Updated 4 years ago
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆83Updated 2 years ago
- The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Te…☆31Updated 4 years ago
- ☆121Updated 2 years ago
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆22Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆138Updated last year
- ☆84Updated 3 years ago
- Dataset and code for EMNLP2020 paper "HybridQA: A Dataset of Multi-Hop Question Answeringover Tabular and Textual Data"☆236Updated 2 years ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆94Updated 5 months ago
- ☆17Updated 4 years ago
- The code and data for paper "Large Language Models are few(1)-shot Table Reasoners" [EACL2023]☆47Updated last year
- This repository contains source code for the PASTA model, a pre-trained language model for table-based fact verification.☆18Updated 2 years ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated 2 years ago