solariz / german_stopwordsLinks
Extended list of German stopwords for use in Web Projects, Search Engines or every thing else.
☆104Updated 5 years ago
Alternatives and similar repositories for german_stopwords
Users that are interested in german_stopwords are comparing it to the libraries listed below
Sorting:
- German stopwords collection☆86Updated 2 years ago
- Stemmer for German☆45Updated 3 years ago
- Compound splitter for German☆107Updated 5 years ago
- A lemmatizer for German language text☆91Updated 2 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆239Updated 10 months ago
- GermaNet API for Python☆53Updated 7 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- German language support for TextBlob.☆104Updated 5 months ago
- Coreference resolution for German☆16Updated 8 years ago
- Bot, der Wörter auf Twitter und Mastodon postet, die zum ersten Mal im Bundestag gesagt wurden.☆17Updated 3 months ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆42Updated last year
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Parser für die Plenarprotokolle des Bundestags☆22Updated 7 years ago
- Open Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).☆99Updated 4 months ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆30Updated 6 years ago
- Parser für die Plenarprotokolle des Bundestags☆14Updated 5 years ago
- natural language processing on german texts☆16Updated 7 years ago
- German Parliamentary Corpus (GerParCor)☆24Updated 3 months ago
- Scraper for German democracy documents☆37Updated last year
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆83Updated 4 years ago
- COVID-19 statistics for Germany. For states and counties. With time series data. Daily updates. Official RKI numbers.☆148Updated last year
- A part-of-speech tagger with support for domain adaptation and external resources.☆23Updated 2 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 5 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆487Updated 7 months ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- German part-of-speech dictionary☆45Updated last year
- ☆55Updated 9 years ago
- ☆97Updated 3 years ago