jfilter / german-preprocessingLinks
🇩🇪 Preprocess German texts to do some serious natural-language processing.
☆12Updated 3 years ago
Alternatives and similar repositories for german-preprocessing
Users that are interested in german-preprocessing are comparing it to the libraries listed below
Sorting:
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆162Updated last year
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆512Updated last year
- A lemmatizer for German language text☆94Updated 2 years ago
- Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an…☆19Updated 3 weeks ago
- Legal Reference Extraction☆38Updated 8 months ago
- Open Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).☆103Updated 10 months ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆19Updated last year
- GermaParl: Corpus of Plenary Protocols of the German Bundestag (TEI Format)☆36Updated 2 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆240Updated last year
- The Hanover Tagger - A simple approach to lemmatization and POS-tagging of German morphology based on heuristics and hidden markov models…☆55Updated 9 months ago
- Open German WordNet☆99Updated 2 months ago
- German language support for TextBlob.☆102Updated 11 months ago
- NLP-helper for OCR-ed pages in PAGE XML format☆10Updated last year
- Open Legal Data Platform☆124Updated last week
- This repository contains all the materials for my "Python Programming for Linguists" workshop. This is a Python workshop for beginners wi…☆35Updated 2 years ago
- A webapp for labour-time calculation.☆51Updated this week
- Bot, der Wörter auf Twitter und Mastodon postet, die zum ersten Mal im Bundestag gesagt wurden.☆18Updated 9 months ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆18Updated 2 weeks ago
- German sentiment scores with SentiWS as extension for spaCy☆38Updated 3 years ago
- ☆32Updated last year
- The Jupyter Book is aimed at historians who are looking for a first interactive introduction to the Python programming language in German…☆14Updated 3 years ago
- Helsinki Finite-State Technology (library and application suite)☆136Updated 2 months ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆74Updated last year
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Updated 2 years ago
- ParlaMint: Comparable Parliamentary Corpora☆72Updated last month
- ☆20Updated last month
- python package to read and write CLDF datasets☆21Updated 4 months ago
- Ground Truth Resources for the HTR of patrimonial documents☆45Updated this week
- Danish Semantic analysis☆18Updated 5 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆34Updated 3 years ago