ljos / navnkjennerLinks
Named-Entity Recognition for Norwegian Bokmål and Nynorsk
☆12Updated 6 years ago
Alternatives and similar repositories for navnkjenner
Users that are interested in navnkjenner are comparing it to the libraries listed below
Sorting:
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆71Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆170Updated 3 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆96Updated 2 years ago
- spaCy + UDPipe☆166Updated 3 years ago
- spaCy-to-naf converter☆21Updated 7 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆56Updated 2 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆87Updated 3 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆115Updated last year
- Sentence transformers models for SpaCy☆109Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 11 months ago
- Norwegian Transformer Model☆116Updated last week
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python☆112Updated last month
- Wikidata embedding☆51Updated last year
- A collection of notebooks for Natural Language Processing☆25Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆84Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆20Updated 2 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆63Updated last year
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Updated 5 months ago
- A spaCy wrapper for DBpedia Spotlight☆113Updated 2 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆23Updated 3 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆171Updated 4 years ago
- ☆64Updated 2 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆84Updated 4 years ago
- This repository contains the Framester resource, the main outcome of the framester project.☆33Updated 2 months ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions☆12Updated 8 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆79Updated 4 years ago