web64 / norwegian-nlp-resources
Norwegian NLP Resources
☆181Updated 3 years ago
Alternatives and similar repositories for norwegian-nlp-resources:
Users that are interested in norwegian-nlp-resources are comparing it to the libraries listed below
- Norwegian Review Corpus☆48Updated 5 months ago
- Norwegian Transformer Model☆115Updated 2 months ago
- Large-scale language models for Norwegian☆39Updated last year
- Pre-trained Nordic models for BERT☆167Updated 3 years ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆69Updated 5 months ago
- Danish Semantic analysis☆18Updated 4 years ago
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Updated 5 years ago
- The weights for the embedding layer of Scandinavian UMLFiT language models☆32Updated 5 years ago
- This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every …☆31Updated last year
- Spacy model trained based on Norwegian corpus converted from OBT to Universal dep.☆13Updated 7 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆76Updated 3 years ago
- Morphosyntactic tagger for Norwegian bokmål and nynorsk☆30Updated last year
- Norwegian Speech Transformer Models☆18Updated 3 months ago
- A collection of Danish Transformers☆30Updated 3 years ago
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.☆204Updated last week
- Named Entity Recognition for Danish☆17Updated 5 years ago
- Tools for Norwegian NLP based on the Norwegian Dependency Treebank.☆17Updated 7 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated last year
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- spaCy + UDPipe☆160Updated 2 years ago
- A lemmatizer for German language text☆87Updated 2 years ago
- A lemmatizer for Norwegian that uses lexical and contextual information from the Norwegian Dependency Treebank (NDT) and lexical informat…☆7Updated 8 years ago
- Ælæctra was created as part of a Cognitive Science bachelor thesis, in the attempt to enhance the Danish NLP community with a more effici…☆28Updated 2 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- Text tokenization and sentence segmentation (segtok v2)☆201Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- Collection of tools for building diachronic/historical word vectors☆423Updated last year
- UIMA CAS processing library written in Python☆86Updated 9 months ago
- Google USE (Universal Sentence Encoder) for spaCy☆182Updated last year