flexudy-pipe / sentence-doctor
Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of the art SBD, they often depend on text extractors (e.g pdf text extractors or OCR). The quality of these extractors greatly influence the quality of SBD libraries and as a consequence, the performance of downst…
☆61Updated 4 years ago
Alternatives and similar repositories for sentence-doctor:
Users that are interested in sentence-doctor are comparing it to the libraries listed below
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated 2 years ago
- ☆66Updated 4 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆115Updated 3 years ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- Use BERT to Fill in the Blanks☆82Updated 3 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- Dynamic ensemble decoding with transformer-based models☆29Updated last year
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 3 years ago
- Massively Multilingual Transfer for NER☆85Updated 3 years ago
- ☆74Updated 3 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆68Updated 3 years ago
- A embed able annotation tool for end to end cross document co-reference☆41Updated last year
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- Code to reproduce the experiments from the paper.☆101Updated last year
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆54Updated 3 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 2 months ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated 9 months ago
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆21Updated 2 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated last year
- Use-cases of Hugging Face's BERT (e.g. paraphrase generation, unsupervised extractive summarization).☆20Updated 5 years ago
- Data programming by demonstration for information extraction and span annotation☆35Updated 3 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- Topic Inference with Zeroshot models☆61Updated last year
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆79Updated 2 years ago
- Viewer for the 🤗 datasets library.☆84Updated 3 years ago