Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of the art SBD, they often depend on text extractors (e.g pdf text extractors or OCR). The quality of these extractors greatly influence the quality of SBD libraries and as a consequence, the performance of downstβ¦
β62Sep 8, 2020Updated 5 years ago
Alternatives and similar repositories for sentence-doctor
Users that are interested in sentence-doctor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- python project template for personal projects! πββοΈβ11Nov 28, 2020Updated 5 years ago
- Dead simple cron service for making HTTP calls on a regular schedule.β14Jul 11, 2020Updated 5 years ago
- numeric fused-head identification and resolutionβ33Oct 16, 2019Updated 6 years ago
- French Machine Reading for Question Answeringβ18Sep 21, 2022Updated 3 years ago
- Grammatical Error Correction Based on Language Model(BERT, GPT-2), and Seq2Seqβ18Sep 5, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.β14Jul 25, 2023Updated 2 years ago
- Semantic search using Transformers and othersβ110Aug 27, 2020Updated 5 years ago
- Implementation of Viterbi algorithm and Hidden Markov Model in C++β20Mar 17, 2017Updated 9 years ago
- β12Feb 22, 2021Updated 5 years ago
- A Toolkit to Generate Structured Historical Documentsβ15Jun 27, 2020Updated 5 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"β55Dec 2, 2021Updated 4 years ago
- Multilingual AI style enhancement and grammar correction REST API. English, French, Spanish, Arabic, Japanese, Chinese. Based on deep NLPβ¦β10Dec 22, 2019Updated 6 years ago
- Notebooks to accompany the blog posts about the 2nd place Kaggle RSNA winners: https://github.com/darraghdog/rsnaβ30Jan 29, 2020Updated 6 years ago
- Test implementation of "Aligned Cross Entropy for Non-Autoregressive Machine Translation" https://arxiv.org/abs/2004.01655β21Jul 25, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluationβ63Oct 29, 2018Updated 7 years ago
- This repo contains the code for our paper "EditNTS: An Neural Programmer-Interpreter Model for Sentence Simplification through Explicit Eβ¦β58Feb 19, 2020Updated 6 years ago
- An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammaticalβ¦β28Jul 25, 2024Updated last year
- search-rattailcollagen1 created by GitHub Classroomβ10Jan 17, 2021Updated 5 years ago
- Repo for the FB AI Speech team.β25Aug 24, 2021Updated 4 years ago
- Referring Expression Generation using Neural Networksβ22Dec 8, 2022Updated 3 years ago
- A full-text error corrector for English based on transformers and deep learningβ10Jan 8, 2023Updated 3 years ago
- Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"β190May 23, 2025Updated 11 months ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.β462Mar 26, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- File repository for the course [Advanced Deep Learning with Keras]. Packt Publishing.β29Feb 26, 2018Updated 8 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in cβ¦β359Feb 22, 2022Updated 4 years ago
- A Sample repo using the Apriori and FP Growth algorithms to produce categories for queries, and BERT for PoP change visualization.β40Apr 18, 2022Updated 4 years ago
- a Fairseq fork for sequence tagging/labeling tasksβ32Jun 7, 2020Updated 5 years ago
- An Easy Annotation Tool for Natural Language Processingβ11May 17, 2024Updated last year
- Unofficial implementation of Adaptive Input in PyTorchβ12Feb 22, 2019Updated 7 years ago
- Official implementation of a temporal pupil light response model proposed in the Scientific Reports article: "Deep learning-based pupil mβ¦β11Jan 6, 2023Updated 3 years ago
- Neural text-to-text question generationβ216Nov 13, 2020Updated 5 years ago
- β209Nov 12, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.β201May 26, 2024Updated last year
- UDP Traffic Obfuscatorβ10Jun 18, 2014Updated 11 years ago
- sequence tagging for NER for ULMFiTβ20Nov 4, 2020Updated 5 years ago
- code for Question Condensing Networks for Answer Selection in Community Question Answeringβ14Aug 26, 2018Updated 7 years ago
- Linear chain conditional random fields are implemented using Numpy and Mxnet/Gluon, and batch training is supported, not limited to trainβ¦β22Apr 5, 2019Updated 7 years ago
- Bulk Copyscape is a script that utilizes Copyscape's API to by-pass the normal bulk upload queue, allowing you to quickly check websites β¦β17Nov 13, 2022Updated 3 years ago
- TransformerDBβ19Apr 22, 2021Updated 5 years ago