Compound splitter for German
☆112Apr 5, 2020Updated 5 years ago
Alternatives and similar repositories for CharSplit
Users that are interested in CharSplit are comparing it to the libraries listed below
Sorting:
- An unsupervised compound splitter☆42Oct 6, 2019Updated 6 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆35Jul 7, 2022Updated 3 years ago
- Automatic Detection of Potentially Idiomatic Expressions☆12Feb 19, 2021Updated 5 years ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Mar 15, 2017Updated 8 years ago
- A compound word splitter for Python☆49Aug 18, 2021Updated 4 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆24Oct 26, 2022Updated 3 years ago
- ☆11Jan 27, 2026Updated last month
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- The Zurich Dependency Parser for German☆89Aug 27, 2025Updated 6 months ago
- Code for "A Dependency Syntactic Knowledge Augmented Interactive Architecture for End-to-End Aspect-based Sentiment Analysis" on Neurocom…☆17May 19, 2021Updated 4 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Dec 16, 2023Updated 2 years ago
- ☆20Apr 26, 2017Updated 8 years ago
- ☆28Aug 4, 2015Updated 10 years ago
- German part-of-speech dictionary☆46Sep 6, 2023Updated 2 years ago
- Dalphi - Active Learning Platform for Human Interaction☆23Aug 20, 2018Updated 7 years ago
- Automatically exported from code.google.com/p/relation-extraction-corpus☆57Dec 14, 2015Updated 10 years ago
- A python wrapper for the multilingual temporal tagger HeidelTime.☆26Mar 21, 2022Updated 3 years ago
- An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.☆28Apr 5, 2017Updated 8 years ago
- Open German WordNet☆100Jan 7, 2026Updated last month
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆242Aug 21, 2024Updated last year
- A web application tagging and retrieval of arguments in text☆30May 1, 2023Updated 2 years ago
- Simple perceptron tagger trained using the NLTK on the NLCOW14 corpus.☆25Mar 20, 2018Updated 7 years ago
- Any contributions to the NLTK project☆29May 8, 2014Updated 11 years ago
- small Java library for splitting German compound words☆63May 13, 2024Updated last year
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆33May 13, 2024Updated last year
- A Java implementation of the Rapid Automatic Keyword Extraction Framework ( RAKE )☆29Feb 8, 2018Updated 8 years ago
- Morphological Dictionaries for German Language☆30Apr 6, 2018Updated 7 years ago
- Writing Observer and Learning Observer: A system for monitoring learning process data, with an initial focus on writing process data from…☆12Updated this week
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆29Sep 28, 2018Updated 7 years ago
- MinorThird is a collection of Java classes for storing text, annotating text, and learning to extract entities and categorize text.☆58Feb 2, 2018Updated 8 years ago
- Wrapper to use syntaxnet with pre-trained model☆29Jun 24, 2018Updated 7 years ago
- Common web archive utility code.☆61Feb 6, 2026Updated 3 weeks ago
- Using the function read.table() to break file into chunks to loop and process them. This allows processing files of any size beyond what …☆10Aug 19, 2014Updated 11 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- Example to read qr code with kotlin☆10Jul 24, 2018Updated 7 years ago
- ☆39May 31, 2017Updated 8 years ago
- ☆37Nov 16, 2017Updated 8 years ago
- PwnHub is a CTF collaboration platform written in Bash, originally built as a response to a joke about the Bash Stack by yousuckatprogram…☆20Jun 13, 2025Updated 8 months ago