TimKam / compound-word-splitterLinks
A compound word splitter for Python
☆48Updated 3 years ago
Alternatives and similar repositories for compound-word-splitter
Users that are interested in compound-word-splitter are comparing it to the libraries listed below
Sorting:
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Extract dates from text☆64Updated 4 years ago
- Language detection extension for spaCy 2.0+☆113Updated 6 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 10 months ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- ☆70Updated 2 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 11 months ago
- spaCy + UDPipe☆161Updated 3 years ago
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Language Models for Zalando's flair library☆61Updated 5 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated 2 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆115Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- A small tool that EXPLains spACY parse results. See what I did there?☆84Updated 3 years ago
- Finds linguistic patterns effortlessly☆36Updated last year
- Python library for Natural Language Preprocessing (NLPre)☆191Updated last year
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆112Updated 4 months ago
- Various utilities for processing the data.☆209Updated this week