Compound splitter for German
☆112Apr 5, 2020Updated 6 years ago
Alternatives and similar repositories for CharSplit
Users that are interested in CharSplit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A part-of-speech tagger with support for domain adaptation and external resources.☆24Oct 26, 2022Updated 3 years ago
- A lemmatizer for German language text☆95Feb 7, 2023Updated 3 years ago
- Automatic Detection of Potentially Idiomatic Expressions☆12Feb 19, 2021Updated 5 years ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Mar 15, 2017Updated 9 years ago
- German Morphological Analyzer☆54Nov 12, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- GermaNet API for Python☆54Mar 8, 2018Updated 8 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆152Dec 9, 2024Updated last year
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Oct 17, 2023Updated 2 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- ☆20Apr 26, 2017Updated 9 years ago
- The list of Ukrainian words for sentiment analysis and NLP☆15Sep 5, 2021Updated 4 years ago
- A collection of utilities for handling IPA phones.☆27Sep 24, 2023Updated 2 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆526Oct 30, 2024Updated last year
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Mar 29, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆28Aug 4, 2015Updated 10 years ago
- Some examples on computing MLEs using TensorFlow☆15Sep 8, 2017Updated 8 years ago
- Java persistence with RDF☆11Oct 1, 2024Updated last year
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- Coding utilities for quantitative legal studies☆14Dec 7, 2025Updated 6 months ago
- Any contributions to the NLTK project☆29May 8, 2014Updated 12 years ago
- GermaParl R Data Package☆14Aug 31, 2022Updated 3 years ago
- Python client to the INCEpTION annotation tool☆17Jun 10, 2025Updated last year
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆242Jun 11, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- small Java library for splitting German compound words☆66May 13, 2024Updated 2 years ago
- Code for "A Dependency Syntactic Knowledge Augmented Interactive Architecture for End-to-End Aspect-based Sentiment Analysis" on Neurocom…☆17May 19, 2021Updated 5 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆38Dec 16, 2023Updated 2 years ago
- A small python library to parse and write TSV files generated by the WebAnno software.☆11Apr 14, 2025Updated last year
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- natural language processing on german texts☆16Mar 20, 2018Updated 8 years ago
- A python wrapper for the multilingual temporal tagger HeidelTime.☆26Mar 21, 2022Updated 4 years ago
- running LayoutLMv2☆11Apr 27, 2022Updated 4 years ago
- Netzwerklisten der Twitterbots euroedit, bundesedit,politikedits und landesedit☆31May 6, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data files of German Decompounder for Apache Lucene / Apache Solr / Elasticsearch☆112Sep 13, 2021Updated 4 years ago
- Robust Orthonormal Subspace Learning in Python☆15Jun 1, 2020Updated 6 years ago
- An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.☆27Apr 5, 2017Updated 9 years ago
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated 4 months ago
- A localized word dictionary asset for University of Tsukuba☆12Sep 19, 2025Updated 9 months ago
- suffix array construction and searching algorithms for in-memory binary data.☆12Sep 10, 2022Updated 3 years ago
- ☆11Dec 30, 2017Updated 8 years ago