Compound splitter for German
☆112Apr 5, 2020Updated 6 years ago
Alternatives and similar repositories for CharSplit
Users that are interested in CharSplit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An unsupervised compound splitter☆41Oct 6, 2019Updated 6 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆35Jul 7, 2022Updated 3 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆24Oct 26, 2022Updated 3 years ago
- A lemmatizer for German language text☆95Feb 7, 2023Updated 3 years ago
- Automatic Detection of Potentially Idiomatic Expressions☆12Feb 19, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Mar 15, 2017Updated 9 years ago
- ☆12Jan 27, 2026Updated 4 months ago
- German Morphological Analyzer☆54Nov 12, 2021Updated 4 years ago
- The Zurich Dependency Parser for German☆89Aug 27, 2025Updated 9 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆152Dec 9, 2024Updated last year
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Oct 17, 2023Updated 2 years ago
- Language models are open knowledge graphs ( non official implementation )☆13Jan 17, 2021Updated 5 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- ☆20Apr 26, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The list of Ukrainian words for sentiment analysis and NLP☆15Sep 5, 2021Updated 4 years ago
- A collection of utilities for handling IPA phones.☆27Sep 24, 2023Updated 2 years ago
- Morphological Dictionaries for German Language☆32Apr 29, 2026Updated last month
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆524Oct 30, 2024Updated last year
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Mar 29, 2021Updated 5 years ago
- Source code and data for the paper "Towards String-to-Tree Neural Machine Translation"☆16Dec 31, 2017Updated 8 years ago
- Java persistence with RDF☆11Oct 1, 2024Updated last year
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- ☆13May 29, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- German part-of-speech dictionary☆47Sep 6, 2023Updated 2 years ago
- The Paradise Papers dataset and guide from the International Consortium of Investigative Journalists (ICIJ)☆11Oct 25, 2024Updated last year
- GermaParl R Data Package☆14Aug 31, 2022Updated 3 years ago
- Python client to the INCEpTION annotation tool☆17Jun 10, 2025Updated last year
- small Java library for splitting German compound words☆66May 13, 2024Updated 2 years ago
- ☆31Apr 21, 2023Updated 3 years ago
- Code for keyphrase classification systems submitted to the SemEval 2017 shared task ScienceIE.☆36Jun 12, 2018Updated 8 years ago
- ☆16Nov 21, 2022Updated 3 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Automatically exported from code.google.com/p/relation-extraction-corpus☆57Dec 14, 2015Updated 10 years ago
- Occupation Coding refers to coding short verbal texts (survey answers) into an occupational classification. This package implements sever…☆14Mar 11, 2024Updated 2 years ago
- natural language processing on german texts☆16Mar 20, 2018Updated 8 years ago
- Zunda: Japanese Enhanced Modality Analyzer client for Python.☆10Nov 30, 2019Updated 6 years ago
- Netzwerklisten der Twitterbots euroedit, bundesedit,politikedits und landesedit☆31May 6, 2024Updated 2 years ago
- Unofficial CLI tool for Microsoft Azure Speech service management - datasets, models, tests, endpoints etc. Useful for automation.☆10Jun 8, 2020Updated 6 years ago
- An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.☆27Apr 5, 2017Updated 9 years ago