ian-beaver / pycontractions
Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.
โ77Updated 3 years ago
Alternatives and similar repositories for pycontractions:
Users that are interested in pycontractions are comparing it to the libraries listed below
- Language detection extension for spaCy 2.0+โ112Updated 6 years ago
- ๐Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wiโฆโ63Updated last year
- A fully customisable language detection pipeline for spaCyโ92Updated 6 years ago
- Google USE (Universal Sentence Encoder) for spaCyโ184Updated 2 years ago
- Language Models for Zalando's flair libraryโ61Updated 5 years ago
- NLP French language model implementing ULMFiTโ87Updated 6 years ago
- Hunspell extension for spaCy 2.0.โ94Updated 9 months ago
- shabeelkandi / Handling-Out-of-Vocabulary-Words-in-Natural-Language-Processing-using-Language-Modellingโ69Updated 5 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.โ71Updated 2 years ago
- spaCy + UDPipeโ161Updated 3 years ago
- Sentence transformers models for SpaCyโ107Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)โ202Updated 3 years ago
- Python library for Natural Language Preprocessing (NLPre)โ191Updated last year
- The weights for the embedding layer of Scandinavian UMLFiT language modelsโ32Updated 5 years ago
- Python wrapper for wit.ai's Duckling Clojure libraryโ131Updated 3 years ago
- โ72Updated 6 years ago
- Character-based word embeddings model based on RNN for handling real worldย textsโ174Updated last year
- Fixes contractions such as `you're` to `you are`โ317Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddingsโ88Updated 4 years ago
- ๐ Emoji handling and meta data for spaCy with custom extension attributesโ181Updated last year
- Create interactive textual heat maps for Jupiter notebooksโ196Updated 11 months ago
- A compound word splitter for Pythonโ48Updated 3 years ago
- A python module for word inflections designed for use with spaCy.โ92Updated 5 years ago
- Inter-annotator agreement for Doccanoโ27Updated 5 years ago
- A spell checker built from GloVe word vectorsโ81Updated 6 years ago
- LASER multilingual sentence embeddings as a pip packageโ223Updated last year
- Implementation of GloVe in Kerasโ45Updated 2 years ago
- Language independent truecaser in Python.โ160Updated 3 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic feโฆโ169Updated 3 years ago
- Use ML-Annotate to label data for machine learning purposesโ109Updated 4 years ago