nytud / emtsvLinks
e-magyar text processing system -- inter-module communication via tsv + REST API
☆29Updated last month
Alternatives and similar repositories for emtsv
Users that are interested in emtsv are comparing it to the libraries listed below
Sorting:
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆16Updated 2 years ago
- Various utilities for processing the data.☆213Updated this week
- ☆18Updated 4 months ago
- German Morphological Analyzer☆47Updated 3 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆172Updated 2 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆255Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆176Updated 4 months ago
- A curated list of NLP resources for Hungarian☆257Updated 2 months ago
- Bilingual sentence similarity classifier using Tensorflow☆24Updated 6 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆147Updated 10 months ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆114Updated last year
- 🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, …☆20Updated last year
- An Easy Annotation Tool for Natural Language Processing☆11Updated last year
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆57Updated 2 months ago
- This packages up data for the Open Multilingual Wordnet☆55Updated 4 months ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- A modern, interlingual wordnet interface for Python☆265Updated last month
- Universal Dependencies online documentation☆288Updated this week
- LingPy: Python library for quantitative tasks in historical linguistics☆137Updated 2 months ago
- Text tokenization and sentence segmentation (segtok v2)☆206Updated 3 years ago
- A multilingual parallel corpus created from translations of the Bible.☆189Updated 5 months ago
- ☆49Updated last year
- Compound splitter for German☆108Updated 5 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- Bitextor generates translation memories from multilingual websites☆296Updated 11 months ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Updated 2 years ago
- The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.☆114Updated last year
- Python Finite-State Toolkit☆58Updated this week
- A part-of-speech tagger with support for domain adaptation and external resources.☆23Updated 2 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year