ohenrik / nb_dep_ud_sm
Spacy model trained based on Norwegian corpus converted from OBT to Universal dep.
☆13Updated 7 years ago
Alternatives and similar repositories for nb_dep_ud_sm
Users that are interested in nb_dep_ud_sm are comparing it to the libraries listed below
Sorting:
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- An unsupervised compound splitter☆41Updated 5 years ago
- 💫 Scripts, tools and resources for developing spaCy☆126Updated 6 years ago
- Python port of Mikolov's word2phrase.c from the word2vec toolkit☆111Updated 5 years ago
- The weights for the embedding layer of Scandinavian UMLFiT language models☆32Updated 5 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 9 months ago
- Python library for Natural Language Preprocessing (NLPre)☆191Updated last year
- Library for unit extraction - fork of quantulum for python3☆138Updated 10 months ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated 2 years ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- 💫 REST microservices for various spaCy-related tasks☆240Updated 2 years ago
- For extracting measurements and related entities from text☆58Updated 5 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆83Updated 3 years ago
- Minimal Named-Entity Recognizer (MER)☆57Updated 7 months ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆115Updated 3 years ago
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 7 years ago
- An introduction to using spaCy for NLP and machine learning☆191Updated 3 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆77Updated 3 years ago
- Making sense embedding out of word embeddings using graph-based word sense induction☆213Updated 3 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)☆115Updated last year
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 6 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated 2 years ago
- NLP framework: sentence detector, tokeniser, pos-tagger and dependency parser☆49Updated last year
- Socially-Equitable Language Identification☆78Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆169Updated 3 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Updated last month
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆69Updated 8 months ago
- ☆22Updated 7 years ago