sorenlind / lemmy
๐คLemmy is a lemmatizer for Danish ๐ฉ๐ฐ and Swedish ๐ธ๐ช
โ76Updated 3 years ago
Alternatives and similar repositories for lemmy:
Users that are interested in lemmy are comparing it to the libraries listed below
- DaCy: The State of the Art Danish NLP pipeline using SpaCyโ95Updated last month
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.โ204Updated last week
- Simple customizable pipeline tool for anonymizing Danish text.โ10Updated 5 months ago
- A curated list of awesome resources for Danish language technologyโ172Updated 2 months ago
- Pre-trained Nordic models for BERTโ167Updated 3 years ago
- Dataframe Integration with spaCy.โ103Updated 3 years ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasksโ157Updated 2 years ago
- The weights for the embedding layer of Scandinavian UMLFiT language modelsโ32Updated 5 years ago
- รlรฆctra was created as part of a Cognitive Science bachelor thesis, in the attempt to enhance the Danish NLP community with a more efficiโฆโ28Updated 2 years ago
- A collection of Danish Transformersโ30Updated 3 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iโฆโ46Updated 10 months ago
- A Danish-speaking language model with entity-aware self-attentionโ9Updated 3 years ago
- ๐ Additional lookup tables and data resources for spaCyโ101Updated 3 weeks ago
- Sentiment Corpus for Swedish ๐ธ๐ช Norwegian ๐ณ๐ด Danish ๐ฉ๐ฐ Finnish ๐ซ๐ฎ (and English ๐ด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ)โ15Updated 3 years ago
- Danish Semantic analysisโ18Updated 4 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more โฆโ112Updated 9 months ago
- German Morphological Analyzerโ47Updated 3 years ago
- spaCy + UDPipeโ160Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsโ54Updated 2 years ago
- Python Multilingual Ucrel Semantic Analysis Systemโ31Updated 6 months ago
- Named Entity Recognition for Danishโ17Updated 5 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.โ23Updated last year
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "Whatโs so special about BERTโs โฆโ136Updated 2 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", preโฆโ83Updated 3 years ago
- Norwegian Transformer Modelโ115Updated 2 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.โ138Updated 2 months ago
- A Scandinavian Benchmark for sentence embeddingsโ33Updated last week
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.โ126Updated 3 years ago
- ParlaMint: Comparable Parliamentary Corporaโ55Updated this week
- The Danish Gigaword projectโ16Updated 4 years ago