danielnaber / jwordsplitter
small Java library for splitting German compound words
☆62Updated 4 months ago
Related projects: ⓘ
- ☆28Updated 9 years ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated 9 months ago
- An unsupervised compound splitter☆40Updated 4 years ago
- Program used to split text into segments☆25Updated last year
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- A Utility Library for Wikipedia dumps☆33Updated 7 years ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 7 years ago
- Extension of the mate-tools NLP pipeline☆66Updated 8 years ago
- Machine translation for the real world☆23Updated 4 years ago
- A Java Wikipedia markup to plain text converter☆37Updated 2 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆41Updated 5 years ago
- German Morphological Analyzer☆45Updated 2 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆16Updated last week
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…☆23Updated last year
- morphologically informed POS tagging for German☆25Updated 3 years ago
- TreeTagger for Java☆16Updated 2 years ago
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆68Updated 3 weeks ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆11Updated last year
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆49Updated 4 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- Thot toolkit for statistical machine translation☆50Updated last year
- IXA pipes Named Entity Tagger (http://ixa2.si.ehu.es/ixa-pipes).☆31Updated 5 years ago
- Yara K-Beam Arc-Eager Dependency Parser☆55Updated 8 years ago
- UIMA-based text classification framework built on top of DKPro Core and DKPro Lab.☆34Updated last year
- Software and resources for natural language processing.☆130Updated 8 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Automatically exported from code.google.com/p/deepsyntacticparsing☆23Updated 9 years ago
- Solr Query Segmenter for structuring unstructured queries☆21Updated 3 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 3 years ago