flexudy-pipe / sentence-doctorLinks
Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of the art SBD, they often depend on text extractors (e.g pdf text extractors or OCR). The quality of these extractors greatly influence the quality of SBD libraries and as a consequence, the performance of downst…
☆62Updated 5 years ago
Alternatives and similar repositories for sentence-doctor
Users that are interested in sentence-doctor are comparing it to the libraries listed below
Sorting:
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Semantic search using Transformers and others☆110Updated 5 years ago
- Use BERT to Fill in the Blanks☆84Updated 3 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated last year
- A embed able annotation tool for end to end cross document co-reference☆42Updated 2 years ago
- AI apps/benchmark for legaltech☆112Updated 4 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆66Updated 4 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 5 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 3 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆55Updated 4 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated 2 months ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆45Updated last year
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆56Updated 2 years ago
- Dynamic ensemble decoding with transformer-based models☆29Updated 2 years ago
- ☆66Updated 5 years ago
- Using BERT for doing the task of Conditional Natural Language Generation by fine-tuning pre-trained BERT on custom dataset.☆41Updated 5 years ago
- Viewer for the 🤗 datasets library.☆86Updated 4 years ago
- Preprocessing Library for Natural Language Processing☆166Updated 3 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆100Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆64Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- 📄Neural Sentential Paraphrase Generation to Augment Chatbot Training Dataset☆21Updated 3 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35Updated 5 years ago
- Named entity recognizer based on ELMo or BERT as feature extractor and CRF as final classifier☆80Updated 2 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago