Compound splitter for German
☆113Apr 5, 2020Updated 6 years ago
Alternatives and similar repositories for CharSplit
Users that are interested in CharSplit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An unsupervised compound splitter☆42Oct 6, 2019Updated 6 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆35Jul 7, 2022Updated 3 years ago
- A compound word splitter for Python☆49Aug 18, 2021Updated 4 years ago
- A lemmatizer for German language text☆94Feb 7, 2023Updated 3 years ago
- Automatic Detection of Potentially Idiomatic Expressions☆12Feb 19, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Mar 15, 2017Updated 9 years ago
- ☆12Jan 27, 2026Updated 2 months ago
- German Morphological Analyzer☆52Nov 12, 2021Updated 4 years ago
- A small package for handy conversion of german numerals (also ordinal / signed) written as words to numbers.☆12Jan 22, 2026Updated 2 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Oct 17, 2023Updated 2 years ago
- Open German WordNet☆100Jan 7, 2026Updated 3 months ago
- Using DSPy and LLM's to translate Sanskrit verses☆18Jun 22, 2024Updated last year
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆518Oct 30, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Mar 29, 2021Updated 5 years ago
- Source code and data for the paper "Towards String-to-Tree Neural Machine Translation"☆16Dec 31, 2017Updated 8 years ago
- Java persistence with RDF☆11Oct 1, 2024Updated last year
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- ☆13Updated this week
- Coding utilities for quantitative legal studies☆14Dec 7, 2025Updated 4 months ago
- GermaParl R Data Package☆14Aug 31, 2022Updated 3 years ago
- Python client to the INCEpTION annotation tool☆17Jun 10, 2025Updated 10 months ago
- Any contributions to the NLTK project☆29May 8, 2014Updated 11 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆242Mar 19, 2026Updated 3 weeks ago
- Code for keyphrase classification systems submitted to the SemEval 2017 shared task ScienceIE.☆36Jun 12, 2018Updated 7 years ago
- German Language Understanding Evaluation Benchmark @NAACL24☆22Dec 11, 2025Updated 4 months ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Dec 16, 2023Updated 2 years ago
- ☆16Nov 21, 2022Updated 3 years ago
- A small python library to parse and write TSV files generated by the WebAnno software.☆11Apr 14, 2025Updated 11 months ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- Automatically exported from code.google.com/p/relation-extraction-corpus☆57Dec 14, 2015Updated 10 years ago
- Occupation Coding refers to coding short verbal texts (survey answers) into an occupational classification. This package implements sever…☆14Mar 11, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A python wrapper for the multilingual temporal tagger HeidelTime.☆26Mar 21, 2022Updated 4 years ago
- running LayoutLMv2☆11Apr 27, 2022Updated 3 years ago
- Zunda: Japanese Enhanced Modality Analyzer client for Python.☆10Nov 30, 2019Updated 6 years ago
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆33May 13, 2024Updated last year
- Nagios Checks☆11Jan 9, 2024Updated 2 years ago
- An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.☆28Apr 5, 2017Updated 9 years ago
- A localized word dictionary asset for University of Tsukuba☆12Sep 19, 2025Updated 6 months ago