stopwords-iso / stopwords-de
German stopwords collection
☆85Updated 2 years ago
Alternatives and similar repositories for stopwords-de
Users that are interested in stopwords-de are comparing it to the libraries listed below
Sorting:
- Extended list of German stopwords for use in Web Projects, Search Engines or every thing else.☆103Updated 5 years ago
- A lemmatizer for German language text☆89Updated 2 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆23Updated 2 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Custom French POS and lemmatizer based on Lefff for spacy☆66Updated 2 years ago
- Compound splitter for German☆105Updated 5 years ago
- Plan and train German transformer models.☆23Updated 4 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated 2 years ago
- ☆18Updated last week
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 4 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆142Updated 5 months ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Toolkit to compile a comparable/parallel corpus from European Parliament proceedings☆16Updated 5 years ago
- NLP framework: sentence detector, tokeniser, pos-tagger and dependency parser☆49Updated last year
- small Java library for splitting German compound words☆63Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆238Updated 8 months ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆481Updated 6 months ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- German sentiment scores with SentiWS as extension for spaCy☆37Updated 2 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated 3 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- The Hanover Tagger - A simple approach to lemmatization and POS-tagging of German morphology based on heuristics and hidden markov models…☆51Updated last month
- German part-of-speech dictionary☆45Updated last year
- The Italian NLP Tool☆71Updated 2 years ago