sorenlind / lemmyLinks
π€Lemmy is a lemmatizer for Danish π©π° and Swedish πΈπͺ
β79Updated 4 years ago
Alternatives and similar repositories for lemmy
Users that are interested in lemmy are comparing it to the libraries listed below
Sorting:
- DaCy: The State of the Art Danish NLP pipeline using SpaCyβ99Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.β150Updated last year
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.β207Updated 11 months ago
- A curated list of awesome resources for Danish language technologyβ186Updated last year
- spaCy + UDPipeβ166Updated 3 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more β¦β115Updated last year
- Pre-trained Nordic models for BERTβ175Updated 4 years ago
- Dataframe Integration with spaCy.β103Updated 4 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interfaceβ261Updated 5 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iβ¦β46Updated last year
- Linguistic and stylistic complexity measures for (literary) textsβ84Updated last year
- π Additional lookup tables and data resources for spaCyβ113Updated 7 months ago
- spaCy pipeline object for negating concepts in textβ282Updated 7 months ago
- A lemmatizer for German language textβ94Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)β208Updated 3 years ago
- Fuzzy matching and more functionality for spaCy.β259Updated last year
- Python Multilingual Ucrel Semantic Analysis Systemβ35Updated this week
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "Whatβs so special about BERTβs β¦β141Updated 2 years ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)β71Updated last year
- BERT and ELECTRA models trained on Europeana Newspapersβ38Updated 4 years ago
- German Morphological Analyzerβ51Updated 4 years ago
- Google USE (Universal Sentence Encoder) for spaCyβ184Updated 2 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U filesβ391Updated last month
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", preβ¦β84Updated 4 years ago
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.β69Updated 4 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.β23Updated 3 years ago
- A simple toolkit for conducting analyses using corpus methodsβ27Updated 4 years ago
- Norwegian Transformer Modelβ116Updated last week
- A spaCy custom component that extracts and normalizes temporal expressionsβ56Updated 2 years ago
- A Python library for calculating a large variety of metrics from textβ358Updated last year