nytud / emtsv
e-magyar text processing system -- inter-module communication via tsv + REST API
☆27Updated 9 months ago
Related projects: ⓘ
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆14Updated last year
- PurePos is an open source hybrid morphological tagger.☆15Updated 3 years ago
- Python Finite-State Toolkit☆39Updated last month
- A tool for automatic spelling normalization☆20Updated 3 years ago
- A python library for easily querying morphological inflection models trained on Unimorph☆11Updated last year
- Featurize words into orthographic and phonological vectors.☆39Updated last year
- ☆16Updated 5 months ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆42Updated last year
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆53Updated this week
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated last year
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆28Updated 2 years ago
- The curation repository for the data behind Concepticon.☆32Updated this week
- Python framework for processing Universal Dependencies data☆55Updated last week
- The Open Multilingual Wordnet☆58Updated 4 months ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆60Updated this week
- Tools for compiling corpora from Common Crawl☆12Updated 3 weeks ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆60Updated 4 months ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 3 months ago
- Small-vocabulary sequence-to-sequence generation with optional feature conditioning☆29Updated last week
- now you can even use apertium from python☆31Updated 7 months ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- This packages up data for the Open Multilingual Wordnet☆42Updated last year
- The Unicode Cookbook for Linguists☆53Updated 3 years ago
- GermaNet API for Python☆53Updated 6 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆22Updated 10 months ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆70Updated last month
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Open morphology for Finnish☆84Updated last month