flexudy-pipe / sentence-doctorLinks
Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of the art SBD, they often depend on text extractors (e.g pdf text extractors or OCR). The quality of these extractors greatly influence the quality of SBD libraries and as a consequence, the performance of downst…
☆61Updated 5 years ago
Alternatives and similar repositories for sentence-doctor
Users that are interested in sentence-doctor are comparing it to the libraries listed below
Sorting:
- Use BERT to Fill in the Blanks☆83Updated 3 years ago
- Semantic search using Transformers and others☆110Updated 5 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆64Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆45Updated last year
- AI apps/benchmark for legaltech☆112Updated 4 years ago
- ☆66Updated 5 years ago
- A embed able annotation tool for end to end cross document co-reference☆42Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 9 months ago
- Viewer for the 🤗 datasets library.☆85Updated 4 years ago
- Using BERT for doing the task of Conditional Natural Language Generation by fine-tuning pre-trained BERT on custom dataset.☆41Updated 5 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆67Updated 4 years ago
- Preprocessing Library for Natural Language Processing☆166Updated 2 years ago
- Stacked Denoising BERT for Noisy Text Classification (Neural Networks 2020)☆32Updated 2 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆55Updated 3 years ago
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆56Updated 2 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆117Updated 4 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Updated last year
- Dynamic ensemble decoding with transformer-based models☆29Updated 2 years ago
- Code for obtaining the Curation Corpus abstractive text summarisation dataset☆127Updated 4 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated 5 months ago
- Open source library for few shot NLP☆79Updated 2 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 3 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago