dkalpakchi / awesome-swedish-nlpLinks
A curated list of resources for natural language processing (NLP) in Swedish
β26Updated 2 years ago
Alternatives and similar repositories for awesome-swedish-nlp
Users that are interested in awesome-swedish-nlp are comparing it to the libraries listed below
Sorting:
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.β207Updated 9 months ago
- π€Lemmy is a lemmatizer for Danish π©π° and Swedish πΈπͺβ79Updated 4 years ago
- A Python library for calculating a large variety of metrics from textβ353Updated 11 months ago
- Linguistic and stylistic complexity measures for (literary) textsβ84Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.β147Updated 11 months ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEvalβ13β198Updated 2 months ago
- This is a simple Python package for calculating a variety of lexical diversity indicesβ81Updated 2 years ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCyβ98Updated 10 months ago
- Ten Thousand German News Articles Dataset for Topic Classificationβ86Updated 3 years ago
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postingsβ64Updated 9 months ago
- Google USE (Universal Sentence Encoder) for spaCyβ184Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiencyβ179Updated 5 months ago
- A module to compute textual lexical richness (aka lexical diversity).β110Updated 2 years ago
- Concept Modeling: Topic Modeling on Images and Textβ215Updated last year
- π§ͺ Cutting-edge experimental spaCy components and featuresβ103Updated last year
- Clustering sentence embeddings to extract message intentβ174Updated 4 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β156Updated last year
- A python module for English lemmatization and inflection.β274Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)β207Updated 3 years ago
- β171Updated last year
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a documβ¦β265Updated last year
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more β¦β115Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β81Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iβ¦β46Updated last year
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k seβ¦β155Updated 4 months ago
- A Dutch RoBERTa-based language modelβ206Updated last year
- A minimal, pure Python library to interface with CoNLL-U format files.β152Updated last week
- A modern, interlingual wordnet interface for Pythonβ272Updated this week
- BERT model trained from scratch on Finnishβ95Updated 4 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on Germanβ507Updated last year