rsennrich / clevertagger
morphologically informed POS tagging for German
☆25Updated 3 years ago
Related projects: ⓘ
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆11Updated last year
- Open-source tools for morphological tagging, segmentation and stemming.☆41Updated 5 years ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 7 years ago
- Automatically exported from code.google.com/p/hunpos☆11Updated 6 years ago
- Fast Word Clustering Software☆74Updated last month
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆16Updated last week
- GermaNet API for Python☆53Updated 6 years ago
- A tool for automatic spelling normalization☆20Updated 3 years ago
- The Zurich Dependency Parser for German☆81Updated 2 years ago
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆180Updated 3 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated 9 months ago
- Fast and robust NLP components implemented in Java.☆52Updated 3 years ago
- Parsito: Fast non-projective transition-based dependency parser☆14Updated last year
- CRF-based Morphological Tagging and Lemmatization☆34Updated 4 years ago
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆65Updated last week
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆68Updated 3 weeks ago
- Automatically exported from code.google.com/p/deepsyntacticparsing☆23Updated 9 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆60Updated 4 months ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆123Updated 10 months ago
- ☆12Updated this week
- Extension of the mate-tools NLP pipeline☆66Updated 8 years ago
- ☆95Updated 3 years ago
- small Java library for splitting German compound words☆62Updated 4 months ago
- Thot toolkit for statistical machine translation☆50Updated last year
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆64Updated 2 years ago
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…☆23Updated last year
- German Morphological Analyzer☆45Updated 2 years ago
- Machine translation for the real world☆23Updated 4 years ago