flexudy-pipe / sentence-doctor
Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of the art SBD, they often depend on text extractors (e.g pdf text extractors or OCR). The quality of these extractors greatly influence the quality of SBD libraries and as a consequence, the performance of downst…
☆61Updated 4 years ago
Alternatives and similar repositories for sentence-doctor:
Users that are interested in sentence-doctor are comparing it to the libraries listed below
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- ☆66Updated 4 years ago
- A embed able annotation tool for end to end cross document co-reference☆42Updated 2 years ago
- a Fairseq fork for sequence tagging/labeling tasks☆31Updated 4 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆116Updated 3 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- Massively Multilingual Transfer for NER☆86Updated 3 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 4 months ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP☆108Updated 3 years ago
- Dynamic ensemble decoding with transformer-based models☆29Updated last year
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆62Updated 3 years ago
- ☆33Updated 3 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- reference pytorch code for intent classification☆44Updated 5 months ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 9 months ago
- Use BERT to Fill in the Blanks☆82Updated 3 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated 11 months ago
- ☆56Updated 3 years ago
- Code for A Hierarchical Model for Data-to-Text Generation (Rebuffel, Soulier, Scoutheeten, Gallinari; ECIR 2020)☆81Updated last year
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- Introduction to the recently released T5 model from the paper - Exploring the Limits of Transfer Learning with a Unified Text-to-Text Tra…☆35Updated 4 years ago
- Code to reproduce the experiments from the paper.☆100Updated last year
- Formate converter from one type of qa task datasets to another type☆39Updated 6 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- Build a dialog dataset from online books in many languages☆72Updated 2 years ago