cfiltnlp / HiNER
This repository contains the HiNER dataset released with our paper at LREC 2022
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for HiNER
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆32Updated 3 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆82Updated last month
- Multilingual abstractive summarization dataset extracted from WikiHow.☆84Updated 3 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated last year
- A benchmark for code-switched NLP, ACL 2020☆74Updated 5 months ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- Codebase, data and models for the Keep it Simple paper at ACL2021☆36Updated last year
- A library of translation-based text similarity measures☆25Updated 11 months ago
- Curated list of publicly available parallel corpus for Indian Languages☆30Updated 3 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated last year
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- ☆73Updated 3 years ago
- Dataset of ML and NLP papers☆35Updated 2 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆97Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆62Updated 6 months ago
- SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages☆7Updated 9 months ago
- NTREX -- News Test References for MT Evaluation☆75Updated 5 months ago
- Lexical Simplification with Pretrained Encoders☆69Updated 3 years ago
- Repro is a library for easily running code from published papers via Docker.☆40Updated last year
- ☆13Updated 2 years ago
- Code Repository for the IndicXNLI paper.☆14Updated last year
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Updated 5 years ago
- Automatically detect errors in annotated corpora.☆47Updated last year
- Code for the CRAC 2021 paper "On Generalization in Coreference Resolution" (Best short paper award)☆34Updated last year
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆50Updated 4 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆139Updated 2 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago