rsennrich / clevertagger
morphologically informed POS tagging for German
☆25Updated 3 years ago
Alternatives and similar repositories for clevertagger:
Users that are interested in clevertagger are comparing it to the libraries listed below
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated last month
- A tool for automatic spelling normalization☆20Updated 4 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆41Updated 5 years ago
- CRF-based Morphological Tagging and Lemmatization☆36Updated 5 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Updated 8 years ago
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…☆24Updated 2 months ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated 2 weeks ago
- Fast Word Clustering Software☆78Updated last month
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- The Zurich Dependency Parser for German☆83Updated 2 years ago
- Hierarchical phrase-based machine translation system☆32Updated 10 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆17Updated last week
- Automatically exported from code.google.com/p/hunpos☆12Updated 6 years ago
- Parsito: Fast non-projective transition-based dependency parser☆14Updated 2 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated last year
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated 10 months ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆126Updated 3 months ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆41Updated 4 months ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 8 years ago
- Program used to split text into segments☆25Updated 5 months ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- Code for the paper Faster Phrase-Based Decoding by Refining Feature State☆14Updated 2 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆60Updated 7 years ago
- The Non-Official Characterization (NOC) List is a knowledge-base containing semantic triples about famous people, living and dead, fictio…☆24Updated 6 years ago