MSeal / cython_hunspell
Cython wrapper on Hunspell Dictionary
β66Updated 7 months ago
Alternatives and similar repositories for cython_hunspell:
Users that are interested in cython_hunspell are comparing it to the libraries listed below
- Hunspell extension for spaCy 2.0.β94Updated 6 months ago
- π Additional lookup tables and data resources for spaCyβ100Updated 2 weeks ago
- A python module for word inflections designed for use with spaCy.β92Updated 5 years ago
- The Open Multilingual Wordnetβ61Updated 9 months ago
- Text tokenization and sentence segmentation (segtok v2)β202Updated 2 years ago
- Transform TMX to textβ28Updated 2 years ago
- Automatic extraction of edited sentences from text edition histories.β82Updated 3 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.β148Updated last year
- A fully customisable language detection pipeline for spaCyβ92Updated 5 years ago
- Language independent truecaser in Python.β160Updated 3 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data filesβ27Updated 5 years ago
- CONLL-U to Pandas DataFrameβ31Updated 7 years ago
- Python framework for processing Universal Dependencies dataβ55Updated last week
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic feβ¦β170Updated 3 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipediaβ72Updated 9 years ago
- spaCy + UDPipeβ160Updated 2 years ago
- Various utilities for processing the data.β207Updated this week
- Tool for parsing and converting various span encoding schemes.β22Updated last year
- German lemmatization with IWNLP as extension for spaCyβ24Updated last year
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers atβ¦β22Updated 6 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β79Updated 7 months ago
- OpusFilter - Parallel corpus processing toolkitβ104Updated 2 weeks ago
- Multi Tier Annotation Searchβ26Updated 3 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheniβ¦β12Updated last year
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engineβ186Updated 4 years ago
- π§ͺ Cutting-edge experimental spaCy components and featuresβ96Updated 9 months ago
- Tool to fix bitexts and tag near-duplicates for removalβ29Updated last week
- German Morphological Analyzerβ47Updated 3 years ago
- Cython wrapper on Hunspell Dictionaryβ23Updated last year
- Alignment and annotation for comparable documents.β22Updated 6 years ago