flexudy-pipe / sentence-doctorLinks
Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of the art SBD, they often depend on text extractors (e.g pdf text extractors or OCR). The quality of these extractors greatly influence the quality of SBD libraries and as a consequence, the performance of downst…
☆62Updated 5 years ago
Alternatives and similar repositories for sentence-doctor
Users that are interested in sentence-doctor are comparing it to the libraries listed below
Sorting:
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Use BERT to Fill in the Blanks☆84Updated 3 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 11 months ago
- Dynamic ensemble decoding with transformer-based models☆29Updated 2 years ago
- A embed able annotation tool for end to end cross document co-reference☆42Updated 2 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆66Updated 4 years ago
- ☆66Updated 5 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆45Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- Semantic search using Transformers and others☆110Updated 5 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆56Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated last month
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆64Updated 2 years ago
- Viewer for the 🤗 datasets library.☆86Updated 4 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Updated last year
- QED: A Framework and Dataset for Explanations in Question Answering☆119Updated 4 years ago
- AI apps/benchmark for legaltech☆112Updated 4 years ago
- Using BERT for doing the task of Conditional Natural Language Generation by fine-tuning pre-trained BERT on custom dataset.☆41Updated 5 years ago
- Code for obtaining the Curation Corpus abstractive text summarisation dataset☆127Updated 5 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆100Updated 2 years ago
- xfspell — the Transformer Spell Checker☆189Updated 5 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35Updated 5 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆78Updated 3 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆55Updated 4 years ago
- Preprocessing Library for Natural Language Processing☆166Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year