xliuhw / NLU-Evaluation-Data
Copora for evaluating NLU Services/Platforms such as Dialogflow, LUIS, Watson, Rasa etc.
☆110Updated 2 years ago
Alternatives and similar repositories for NLU-Evaluation-Data:
Users that are interested in NLU-Evaluation-Data are comparing it to the libraries listed below
- Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)☆205Updated 3 years ago
- A collection of task-specific NLU datasets☆148Updated 2 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 3 years ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆283Updated last year
- Massively Multilingual Transfer for NER☆85Updated 3 years ago
- A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.☆78Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- ☆56Updated 3 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated 2 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated last year
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆140Updated 2 years ago
- Please see the readme file as well as our 2019 EMNLP paper linked here -->☆198Updated 9 months ago
- BERTserini☆25Updated 2 years ago
- Source code to reproduce results of our paper "DIET: Lightweight Language Understanding for Dialogue Systems"☆61Updated 4 years ago
- datasets of natural language understanding and dialogue state tracking☆144Updated 4 years ago
- a Fairseq fork for sequence tagging/labeling tasks☆31Updated 4 years ago
- GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)☆93Updated 2 years ago
- ☆76Updated 2 years ago
- This repo includes extensions to the Stanford Dialogue Corpus. It contains crowd-sourced rewrites to facilitate research in dialogue stat…☆89Updated 5 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆151Updated 2 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- Evidence-based QA system for community question answering.☆105Updated 3 years ago
- Official repository for "Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems"☆68Updated 3 years ago
- Multilingual Dialogue Datasets☆19Updated 2 years ago
- ☆66Updated 4 years ago
- The implementation of the paper "Evaluating Coherence in Dialogue Systems using Entailment"☆74Updated 5 months ago
- Dialog State Tracking Challenge 2 & 3 Data☆85Updated 2 years ago
- This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empiric…☆125Updated last year
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago