Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of the art SBD, they often depend on text extractors (e.g pdf text extractors or OCR). The quality of these extractors greatly influence the quality of SBD libraries and as a consequence, the performance of downst…
☆62Sep 8, 2020Updated 5 years ago
Alternatives and similar repositories for sentence-doctor
Users that are interested in sentence-doctor are comparing it to the libraries listed below
Sorting:
- The goal of this project is to provide an easy to use open source tool for data labelling.☆17Jul 11, 2023Updated 2 years ago
- Multilingual AI style enhancement and grammar correction REST API. English, French, Spanish, Arabic, Japanese, Chinese. Based on deep NLP…☆10Dec 22, 2019Updated 6 years ago
- ☆12Feb 22, 2021Updated 5 years ago
- Referring Expression Generation using Neural Networks☆22Dec 8, 2022Updated 3 years ago
- Unofficial implementation of Adaptive Input in PyTorch☆12Feb 22, 2019Updated 7 years ago
- Notebooks to accompany the blog posts about the 2nd place Kaggle RSNA winners: https://github.com/darraghdog/rsna☆30Jan 29, 2020Updated 6 years ago
- A Toolkit to Generate Structured Historical Documents☆15Jun 27, 2020Updated 5 years ago
- A text similarity computation using minhashing and Jaccard distance on reuters dataset☆17Jun 11, 2018Updated 7 years ago
- This repo contains the code for our paper "EditNTS: An Neural Programmer-Interpreter Model for Sentence Simplification through Explicit E…☆58Feb 19, 2020Updated 6 years ago
- A Sample repo using the Apriori and FP Growth algorithms to produce categories for queries, and BERT for PoP change visualization.☆40Apr 18, 2022Updated 3 years ago
- Machine Translation Metrics Unit TesTing☆13Jun 4, 2016Updated 9 years ago
- Bulk Copyscape is a script that utilizes Copyscape's API to by-pass the normal bulk upload queue, allowing you to quickly check websites …☆17Nov 13, 2022Updated 3 years ago
- FIGMENT☆15Jan 27, 2020Updated 6 years ago
- Using PubMed to find out how a gene contributes to addiction.☆20Dec 27, 2022Updated 3 years ago
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Oct 29, 2018Updated 7 years ago
- Using BERT for doing the task of Conditional Natural Language Generation by fine-tuning pre-trained BERT on custom dataset.☆41Feb 18, 2020Updated 6 years ago
- Semantic search using Transformers and others☆110Aug 27, 2020Updated 5 years ago
- Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"☆190May 23, 2025Updated 9 months ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- code for Question Condensing Networks for Answer Selection in Community Question Answering☆14Aug 26, 2018Updated 7 years ago
- A sentiment classifier on mixed language (and mixed script) reviews in Tamil, Malayalam and English☆17Apr 9, 2021Updated 4 years ago
- Grammatical Error Correction Based on Language Model(BERT, GPT-2), and Seq2Seq☆18Sep 5, 2019Updated 6 years ago
- Finetune multiple pre-trained Transformer-based models to solve Vietnamese Fake News Detection problem (ReINTEL) in VLSP2020 shared task☆18Dec 16, 2020Updated 5 years ago
- sequence tagging for NER for ULMFiT☆20Nov 4, 2020Updated 5 years ago
- xfspell — the Transformer Spell Checker☆189Jun 18, 2020Updated 5 years ago
- A systematic comparison between pipeline and end-to-end architectures in the RDF-to-text task☆19Feb 15, 2023Updated 3 years ago
- AI-free static security scanner for Claude Code artifacts (Skills, Hooks, MCP configs). Detects data exfiltration, prompt injection, and …☆17Updated this week
- Applying progressive resizing to building models in Keras.☆18Apr 28, 2019Updated 6 years ago
- Word sense disambiguation using contextualized word embedding☆17Dec 18, 2019Updated 6 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Apr 21, 2021Updated 4 years ago
- ☆207Nov 12, 2021Updated 4 years ago
- DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks☆1,267Mar 2, 2023Updated 3 years ago
- Coreference Model Experimentation (Tensorflow and Pytorch) : Mainly Using transfer learning and Transformer Model BERT☆22Oct 15, 2019Updated 6 years ago
- CoreNLG is an easy to use and productivity oriented Python library for Natural Language Generation. It aims to provide the essential tool…☆27Jul 9, 2021Updated 4 years ago
- LexNLP by LexPredict☆767May 27, 2024Updated last year
- Neural text-to-text question generation☆216Nov 13, 2020Updated 5 years ago
- Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.☆201May 26, 2024Updated last year
- Assorted tools and utility functions, mainly for doing NLP with Python☆23Sep 12, 2025Updated 5 months ago
- File repository for the course [Advanced Deep Learning with Keras]. Packt Publishing.☆29Feb 26, 2018Updated 8 years ago