flexudy-pipe / sentence-doctor
Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of the art SBD, they often depend on text extractors (e.g pdf text extractors or OCR). The quality of these extractors greatly influence the quality of SBD libraries and as a consequence, the performance of downst…
☆61Updated 4 years ago
Alternatives and similar repositories for sentence-doctor:
Users that are interested in sentence-doctor are comparing it to the libraries listed below
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- ☆66Updated 4 years ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- Dynamic ensemble decoding with transformer-based models☆29Updated last year
- Massively Multilingual Transfer for NER☆86Updated 3 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 5 months ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 4 years ago
- ☆76Updated 3 years ago
- ☆64Updated 2 years ago
- reference pytorch code for intent classification☆44Updated 6 months ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- Use BERT to Fill in the Blanks☆82Updated 3 years ago
- ☆33Updated 3 years ago
- a Fairseq fork for sequence tagging/labeling tasks☆31Updated 4 years ago
- A embed able annotation tool for end to end cross document co-reference☆42Updated 2 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 3 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated 2 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 3 years ago
- Formate converter from one type of qa task datasets to another type☆39Updated 6 years ago
- ⛔ [NOT MAINTAINED] A web-based annotator for closed-domain question answering datasets with SQuAD format.☆88Updated 2 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated 11 months ago
- QED: A Framework and Dataset for Explanations in Question Answering☆116Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- Open source library for few shot NLP☆78Updated last year
- This repository will contain the data and codes for WNUT 2020 NER task☆51Updated 2 years ago