sorenlind / lemmy
๐คLemmy is a lemmatizer for Danish ๐ฉ๐ฐ and Swedish ๐ธ๐ช
โ76Updated 3 years ago
Alternatives and similar repositories for lemmy:
Users that are interested in lemmy are comparing it to the libraries listed below
- DaCy: The State of the Art Danish NLP pipeline using SpaCyโ95Updated 3 months ago
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.โ205Updated 2 months ago
- Dataframe Integration with spaCy.โ103Updated 4 years ago
- Pre-trained Nordic models for BERTโ169Updated 3 years ago
- spaCy + UDPipeโ161Updated 3 years ago
- A curated list of awesome resources for Danish language technologyโ176Updated 4 months ago
- A collection of Danish Transformersโ30Updated 3 years ago
- Simple customizable pipeline tool for anonymizing Danish text.โ10Updated 7 months ago
- Sentiment Corpus for Swedish ๐ธ๐ช Norwegian ๐ณ๐ด Danish ๐ฉ๐ฐ Finnish ๐ซ๐ฎ (and English ๐ด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ)โ15Updated 3 years ago
- The weights for the embedding layer of Scandinavian UMLFiT language modelsโ32Updated 5 years ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasksโ158Updated 2 years ago
- ๐ Emoji handling and meta data for spaCy with custom extension attributesโ181Updated last year
- ๐ Additional lookup tables and data resources for spaCyโ106Updated 2 months ago
- Named Entity Recognition for Danishโ17Updated 5 years ago
- The Danish Gigaword projectโ16Updated 4 years ago
- Danish Semantic analysisโ18Updated 4 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iโฆโ46Updated last year
- Fuzzy matching and more functionality for spaCy.โ256Updated 9 months ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more โฆโ112Updated 11 months ago
- รlรฆctra was created as part of a Cognitive Science bachelor thesis, in the attempt to enhance the Danish NLP community with a more efficiโฆโ28Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddingsโ88Updated 4 years ago
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.โ65Updated 3 years ago
- Hunspell extension for spaCy 2.0.โ94Updated 8 months ago
- Linguistic and stylistic complexity measures for (literary) textsโ80Updated last year
- Information extraction from English and German texts based on predicate logicโ135Updated last year
- Text tokenization and sentence segmentation (segtok v2)โ201Updated 3 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", preโฆโ83Updated 3 years ago
- A Dutch RoBERTa-based language modelโ201Updated last year
- A Scandinavian Benchmark for sentence embeddingsโ36Updated 2 months ago
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,โฆโ75Updated 4 months ago