nytud / emtsv
e-magyar text processing system -- inter-module communication via tsv + REST API
☆27Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for emtsv
- PurePos is an open source hybrid morphological tagger.☆15Updated 4 years ago
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆14Updated last year
- ☆17Updated last month
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- Python Finite-State Toolkit☆45Updated last week
- A python library for easily querying morphological inflection models trained on Unimorph☆12Updated 2 years ago
- Custom French POS and lemmatizer based on Lefff for spacy☆64Updated last year
- This is an open-source sentiment analysis tool for Hungarian language, written in Python.☆11Updated 8 years ago
- Hungarian tokenizer.☆14Updated 2 years ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- A tool for automatic spelling normalization☆20Updated 3 years ago
- Small-vocabulary sequence-to-sequence generation with optional feature conditioning☆31Updated this week
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- Python framework for processing Universal Dependencies data☆57Updated this week
- 🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, …☆18Updated 4 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆22Updated last year
- A character-wise tokenizer for morphologically rich languages☆27Updated 5 months ago
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …☆15Updated 8 months ago
- An NLP pipeline for Hebrew☆34Updated 7 months ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆29Updated 3 years ago
- Source code for the Apple reproduction☆31Updated 3 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- The NLG tool for Finnish☆22Updated 11 months ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Language detection using Spacy and Fasttext☆54Updated 11 months ago
- The curation repository for the data behind Concepticon.☆34Updated this week
- Parser for KAF NAF files written in Python☆16Updated 3 years ago
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆54Updated 7 months ago