notAI-tech / deepsegment
A sentence segmenter that actually works!
☆306Updated 4 years ago
Alternatives and similar repositories for deepsegment:
Users that are interested in deepsegment are comparing it to the libraries listed below
- Punctuation restoration and spell correction experiments.☆252Updated 4 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 5 years ago
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 3 years ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆223Updated 2 years ago
- xfspell — the Transformer Spell Checker☆190Updated 4 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆413Updated 3 months ago
- ☆72Updated 6 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆316Updated 2 months ago
- LASER multilingual sentence embeddings as a pip package☆223Updated last year
- (yet another not really) awesome topic/text segmentation list☆108Updated 6 years ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- A python module for English lemmatization and inflection.☆268Updated last year
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆437Updated 2 years ago
- Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)☆158Updated 5 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆731Updated 8 months ago
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆229Updated 4 years ago
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆187Updated last year
- Text2Text Language Modeling Toolkit☆300Updated 3 months ago
- A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunk…☆234Updated 6 years ago
- Python library for Natural Language Preprocessing (NLPre)☆191Updated last year
- Unsupervised Statistical Machine Translation☆229Updated 4 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆360Updated last year
- Switchboard Dialog Act Corpus with Penn Treebank links☆144Updated 4 years ago
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 4 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆557Updated 3 years ago
- Segment documents into coherent parts using word embeddings.☆149Updated 3 years ago