flexudy-pipe / sentence-doctorLinks
Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of the art SBD, they often depend on text extractors (e.g pdf text extractors or OCR). The quality of these extractors greatly influence the quality of SBD libraries and as a consequence, the performance of downst…
☆61Updated 4 years ago
Alternatives and similar repositories for sentence-doctor
Users that are interested in sentence-doctor are comparing it to the libraries listed below
Sorting:
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Semantic search using Transformers and others☆110Updated 4 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 7 months ago
- Custom Natural Language Processing with big and small models 🌲🌱☆68Updated 3 years ago
- A embed able annotation tool for end to end cross document co-reference☆42Updated 2 years ago
- ☆66Updated 5 years ago
- Use BERT to Fill in the Blanks☆83Updated 3 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆38Updated 4 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- Viewer for the 🤗 datasets library.☆84Updated 3 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆57Updated last year
- Using BERT for doing the task of Conditional Natural Language Generation by fine-tuning pre-trained BERT on custom dataset.☆41Updated 5 years ago
- Dynamic ensemble decoding with transformer-based models☆29Updated 2 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- ⛔ [NOT MAINTAINED] A web-based annotator for closed-domain question answering datasets with SQuAD format.☆88Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated last year
- Open source library for few shot NLP☆78Updated 2 years ago
- Data programming by demonstration for information extraction and span annotation☆35Updated 3 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆54Updated 3 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- QED: A Framework and Dataset for Explanations in Question Answering☆117Updated 3 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated last year
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆34Updated 5 years ago
- Introduction to the recently released T5 model from the paper - Exploring the Limits of Transfer Learning with a Unified Text-to-Text Tra…☆35Updated 5 years ago