coosto / dutch-word-embeddings
Dutch word embeddings, trained on a large collection of Dutch social media messages and news/blog/forum posts.
☆44Updated 2 years ago
Alternatives and similar repositories for dutch-word-embeddings:
Users that are interested in dutch-word-embeddings are comparing it to the libraries listed below
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆136Updated 2 years ago
- A Dutch RoBERTa-based language model☆198Updated 10 months ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆82Updated 3 years ago
- spaCy + UDPipe☆160Updated 2 years ago
- 110k Dutch Book Reviews Dataset for Sentiment Analysis☆30Updated last year
- UIMA CAS processing library written in Python☆86Updated 9 months ago
- The weights for the embedding layer of Scandinavian UMLFiT language models☆32Updated 5 years ago
- Language Models for Zalando's flair library☆61Updated 5 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆324Updated 2 months ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆75Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆76Updated 3 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated last year
- Athens NLP Summer School Labs☆42Updated 11 months ago
- Fuzzy matching and more functionality for spaCy.☆254Updated 7 months ago
- A python wrapper for the multilingual temporal tagger HeidelTime.☆26Updated 2 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- Sentence transformers models for SpaCy☆107Updated last year
- Google USE (Universal Sentence Encoder) for spaCy☆182Updated last year
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆47Updated last month
- spaCy pipeline object for negating concepts in text☆279Updated 8 months ago
- Spacy NER annotator using ipywidgets☆120Updated 10 months ago
- A module to compute textual lexical richness (aka lexical diversity).☆99Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆79Updated 7 months ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 4 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆138Updated 2 months ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆71Updated last year
- 📂 Additional lookup tables and data resources for spaCy☆100Updated 2 weeks ago