ruiqi-zhong / TestSuiteEval
"Semantic Evaluation for Text-to-SQL with Distilled Test Suite", EMNLP2020
☆36Updated 4 years ago
Alternatives and similar repositories for TestSuiteEval:
Users that are interested in TestSuiteEval are comparing it to the libraries listed below
- ☆17Updated 3 years ago
- Data and Code Release for "On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries"☆52Updated 4 years ago
- Code for the EMNLP 2020 paper "Re-examining the Role of Schema Linking in Text-to-SQL".☆28Updated 4 years ago
- ☆31Updated 3 years ago
- Source code for Grounded Adaptation for Zero-shot Executable Semantic Parsing☆21Updated 4 years ago
- DuoRAT is a ServiceNow Research project that was started at Element AI.☆56Updated last year
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆83Updated last year
- ☆45Updated 2 years ago
- A zero-shot neural semantic parser without using annotated parallel training data.☆9Updated 2 years ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year
- code for the NAACL 2021 paper Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention by Microsoft S…☆11Updated last year
- ☆17Updated 4 years ago
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion☆13Updated last year
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆77Updated last week
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated last year
- ☆85Updated last year
- The source code of paper "Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph" in KDD2022.☆14Updated 2 years ago
- Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing☆42Updated 4 years ago
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages☆23Updated 2 years ago
- ☆82Updated last year
- The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Te…☆32Updated 3 years ago
- Code for the paper "Open Domain Question Answering with A Unified Knowledge Interface" (ACL 2022)☆57Updated last year
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆75Updated 2 months ago
- ☆14Updated 3 years ago
- ☆16Updated 3 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆77Updated 10 months ago
- The code and data used for EACL2023 Paper: "Large Language Models are few(1)-shot Table Reasoners"☆41Updated 9 months ago
- ☆28Updated 11 months ago
- ☆44Updated last year
- ☆30Updated last year