sorenlind / lemmy
๐คLemmy is a lemmatizer for Danish ๐ฉ๐ฐ and Swedish ๐ธ๐ช
โ76Updated 3 years ago
Alternatives and similar repositories for lemmy:
Users that are interested in lemmy are comparing it to the libraries listed below
- DaCy: The State of the Art Danish NLP pipeline using SpaCyโ95Updated 2 months ago
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.โ204Updated last month
- Simple customizable pipeline tool for anonymizing Danish text.โ10Updated 6 months ago
- Pre-trained Nordic models for BERTโ168Updated 3 years ago
- A curated list of awesome resources for Danish language technologyโ172Updated 3 months ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasksโ157Updated 2 years ago
- Dataframe Integration with spaCy.โ103Updated 4 years ago
- Danish Semantic analysisโ18Updated 4 years ago
- ๐ Additional lookup tables and data resources for spaCyโ105Updated last month
- A collection of Danish Transformersโ30Updated 3 years ago
- A lemmatizer for German language textโ88Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddingsโ88Updated 4 years ago
- The weights for the embedding layer of Scandinavian UMLFiT language modelsโ32Updated 5 years ago
- spaCy + UDPipeโ161Updated 2 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more โฆโ112Updated 10 months ago
- รlรฆctra was created as part of a Cognitive Science bachelor thesis, in the attempt to enhance the Danish NLP community with a more efficiโฆโ28Updated 2 years ago
- Sentiment Corpus for Swedish ๐ธ๐ช Norwegian ๐ณ๐ด Danish ๐ฉ๐ฐ Finnish ๐ซ๐ฎ (and English ๐ด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ)โ15Updated 3 years ago
- Named Entity Recognition for Danishโ17Updated 5 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iโฆโ46Updated 11 months ago
- ๐งช Cutting-edge experimental spaCy components and featuresโ97Updated 11 months ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", preโฆโ83Updated 3 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensoโฆโ237Updated 7 months ago
- Norwegian Transformer Modelโ115Updated 3 months ago
- UIMA CAS processing library written in Pythonโ87Updated this week
- The Danish Gigaword projectโ16Updated 4 years ago
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.โ66Updated 3 years ago
- Norwegian Review Corpusโ47Updated 6 months ago
- A Danish-speaking language model with entity-aware self-attentionโ9Updated 3 years ago
- Linguistic and stylistic complexity measures for (literary) textsโ79Updated last year
- BERT and ELECTRA models trained on Europeana Newspapersโ37Updated 3 years ago