Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern string search
☆35Jul 7, 2022Updated 3 years ago
Alternatives and similar repositories for german_compound_splitter
Users that are interested in german_compound_splitter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Compound splitter for German☆113Apr 5, 2020Updated 6 years ago
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace☆13Nov 29, 2022Updated 3 years ago
- Alignment and annotation for comparable documents.☆22Oct 16, 2018Updated 7 years ago
- ☆12Jan 27, 2026Updated 3 months ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆38Jan 16, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- Extension for pie to include taggers with their models and pre/postprocessors☆11May 30, 2024Updated last year
- IPA Phonetic dataset lexicon☆19Apr 18, 2026Updated 2 weeks ago
- German Language Understanding Evaluation Benchmark @NAACL24☆23Dec 11, 2025Updated 4 months ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- Legal Reference Extraction☆46Apr 22, 2026Updated 2 weeks ago
- A lemmatizer for German language text☆95Feb 7, 2023Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- Coqui Inference Engine☆41Aug 3, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆521Oct 30, 2024Updated last year
- The source code for the TIRA Shared Task Platform☆17Updated this week
- Awesome stuff made by the Mycroft community☆13Sep 16, 2021Updated 4 years ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆20Aug 28, 2023Updated 2 years ago
- Wrapper for the yr.no weather service API.☆15Apr 12, 2018Updated 8 years ago
- Automatic Limerick Generation☆11Mar 18, 2021Updated 5 years ago
- The NLPStatTest project☆12Mar 12, 2022Updated 4 years ago
- Poems retrieval demo built with GNES framework☆14Oct 3, 2019Updated 6 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is a project for visualizing word embeddings based on the work of Andrei Kashcha (@anvaka).☆21Mar 29, 2019Updated 7 years ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Oct 27, 2022Updated 3 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- text-to-speech alignment java software☆20Aug 25, 2019Updated 6 years ago
- ⚙️ Das Backend zu OffeneGesetze.de☆25Jan 11, 2024Updated 2 years ago
- Simple word to frequency mappings for the german language based on text corpora and using CISTEM stemmer.☆14Apr 3, 2021Updated 5 years ago
- ☆71Oct 29, 2021Updated 4 years ago
- A rolling version of the Latent Dirichlet Allocation.☆13Nov 27, 2023Updated 2 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Nov 25, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code to create the dataset from "A New Aligned Simple German Corpus☆11Jan 8, 2024Updated 2 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Aug 6, 2019Updated 6 years ago
- X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents (JCDL 2022)☆14Jul 22, 2022Updated 3 years ago
- Home surveillance system with facial recognition☆17Jun 10, 2020Updated 5 years ago
- A very tiny python api for the stock exchange tradegate.de☆16Jan 20, 2022Updated 4 years ago
- Code supporting the paper Graph-Embedding Empowered Entity Retrieval☆24Apr 11, 2025Updated last year
- ☆22Sep 16, 2021Updated 4 years ago