cfiltnlp / HiNER
This repository contains the HiNER dataset released with our paper at LREC 2022
☆14Updated last year
Alternatives and similar repositories for HiNER:
Users that are interested in HiNER are comparing it to the libraries listed below
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- A benchmark for code-switched NLP, ACL 2020☆74Updated 10 months ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- Code Repository for the IndicXNLI paper.☆15Updated last year
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Updated 4 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- Codebase, data and models for the Keep it Simple paper at ACL2021☆39Updated last year
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆90Updated last month
- ☆75Updated 3 years ago
- Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre…☆33Updated 3 years ago
- This repository is dedicated to development of code-mixed language resources.☆24Updated last year
- Curriculum training☆17Updated last month
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆44Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆106Updated last year
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆62Updated 11 months ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- Dataset of sentences from Hindi stories tagged with different emotion tags☆10Updated 5 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆46Updated 2 years ago
- Automatically detect errors in annotated corpora.☆47Updated last year
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- GupShup: Summarizing Open-Domain Code-Switched Conversations EMNLP 2021☆15Updated 3 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆86Updated last week
- A library of translation-based text similarity measures☆25Updated last year
- QED: A Framework and Dataset for Explanations in Question Answering☆116Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆153Updated 10 months ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆75Updated 3 years ago