notAI-tech / deepsegment
A sentence segmenter that actually works!
☆303Updated 4 years ago
Alternatives and similar repositories for deepsegment:
Users that are interested in deepsegment are comparing it to the libraries listed below
- Punctuation restoration and spell correction experiments.☆250Updated 3 years ago
- Text and Punctuation correction with Deep Learning☆129Updated 4 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆221Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)☆203Updated 2 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- xfspell — the Transformer Spell Checker☆188Updated 4 years ago
- Language independent truecaser in Python.☆161Updated 3 years ago
- Segment documents into coherent parts using word embeddings.☆148Updated 2 years ago
- A python module for English lemmatization and inflection.☆265Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆353Updated last year
- Unsupervised Statistical Machine Translation☆229Updated 4 years ago
- (yet another not really) awesome topic/text segmentation list☆107Updated 6 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆180Updated last year
- BERT fine-tuning for POS tagging task (Keras)☆75Updated 5 years ago
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 4 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆311Updated this week
- Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)☆158Updated 5 years ago
- JFLEG (JHU FLuency-Extended GUG) corpus for Grammatical Error Correction Evaluation☆112Updated last year
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆440Updated 9 months ago
- Bitextor generates translation memories from multilingual websites☆293Updated 2 months ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆83Updated 5 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆490Updated 7 months ago
- This repository contains various ways to calculate sentence vector similarity using NLP models☆200Updated 4 years ago
- Unsupervised Question answering via Cloze Translation☆219Updated 2 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆227Updated last year
- ☆73Updated 6 years ago
- Text2Text Language Modeling Toolkit☆292Updated this week
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆432Updated last year
- Fast, DB Backed pretrained word embeddings for natural language processing.☆223Updated last year