nytud / emtsvLinks
e-magyar text processing system -- inter-module communication via tsv + REST API
☆29Updated last month
Alternatives and similar repositories for emtsv
Users that are interested in emtsv are comparing it to the libraries listed below
Sorting:
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆15Updated last year
- PurePos is an open source hybrid morphological tagger.☆16Updated 4 years ago
- A NoSketch Engine Docker image which is easy to use☆19Updated 7 months ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆65Updated last week
- A simple configurable tool for manipulating dependency trees.☆13Updated 5 months ago
- The NLG tool for Finnish☆23Updated last year
- Yet another search platform for linguistic corpora.☆25Updated 2 weeks ago
- An NLP pipeline for Hebrew☆38Updated this week
- Python Finite-State Toolkit☆55Updated last week
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆30Updated 3 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆169Updated 7 months ago
- LingPy: Python library for quantitative tasks in historical linguistics☆134Updated 2 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆80Updated 2 weeks ago
- A character-wise tokenizer for morphologically rich languages☆27Updated 3 months ago
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …☆15Updated last year
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆113Updated last year
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated 3 weeks ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆48Updated last year
- A curated list of NLP resources for Hungarian☆247Updated last month
- A python library for easily querying morphological inflection models trained on Unimorph☆13Updated 2 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- Experimental Finnish language model for SpaCy☆41Updated 6 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 11 months ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Full Stack of Latvian Language Resources for Natural Language Understanding (NLU) and Generation (NLG)☆15Updated 2 years ago
- Morphological analyzer and lemmatizer for Latin.☆27Updated 4 months ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated last year