ian-beaver / pycontractions
Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.
☆75Updated 3 years ago
Alternatives and similar repositories for pycontractions:
Users that are interested in pycontractions are comparing it to the libraries listed below
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- Language Models for Zalando's flair library☆61Updated 5 years ago
- Character-based word embeddings model based on RNN for handling real world texts☆173Updated last year
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆62Updated last year
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 6 months ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- spaCy + UDPipe☆160Updated 2 years ago
- Python library for Natural Language Preprocessing (NLPre)☆190Updated last year
- A compound word splitter for Python☆48Updated 3 years ago
- shabeelkandi / Handling-Out-of-Vocabulary-Words-in-Natural-Language-Processing-using-Language-Modelling☆68Updated 5 years ago
- The weights for the embedding layer of Scandinavian UMLFiT language models☆32Updated 5 years ago
- 📂 Additional lookup tables and data resources for spaCy☆100Updated 2 weeks ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆79Updated 7 months ago
- Word Embeddings for Information Retrieval☆225Updated last year
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Sentence transformers models for SpaCy☆107Updated last year
- NLP French language model implementing ULMFiT☆87Updated 5 years ago
- Wrapper to use syntaxnet with pre-trained model☆29Updated 6 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 2 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 6 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 4 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Storage and retrieval of Word Embeddings in various databases☆51Updated 6 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago