nytud / emtsv
e-magyar text processing system -- inter-module communication via tsv + REST API
☆29Updated last month
Alternatives and similar repositories for emtsv
Users that are interested in emtsv are comparing it to the libraries listed below
Sorting:
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆15Updated last year
- PurePos is an open source hybrid morphological tagger.☆16Updated 4 years ago
- This is an open-source sentiment analysis tool for Hungarian language, written in Python.☆11Updated 8 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆167Updated 6 months ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- Hungarian tokenizer.☆14Updated 3 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆33Updated 2 weeks ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated last year
- A curated list of NLP resources for Hungarian☆247Updated last month
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- A tool for automatic spelling normalization☆20Updated 4 years ago
- In-browser OCR of Ancient Greek and Latin☆26Updated 3 weeks ago
- Tools for compiling corpora from Common Crawl☆14Updated 5 months ago
- ☆17Updated 2 months ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Updated 2 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆22Updated 3 years ago
- A simple configurable tool for manipulating dependency trees.☆13Updated 4 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆158Updated this week
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated 3 years ago
- 🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, …☆17Updated 10 months ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated last year
- ☆11Updated 3 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆51Updated 2 years ago
- GermaNet API for Python☆53Updated 7 years ago
- UIMA CAS processing library written in Python☆88Updated last month
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- A NoSketch Engine Docker image which is easy to use☆19Updated 6 months ago
- Python 3 library for processing historical English☆67Updated 9 months ago