coosto / dutch-word-embeddings
Dutch word embeddings, trained on a large collection of Dutch social media messages and news/blog/forum posts.
☆44Updated 2 years ago
Alternatives and similar repositories for dutch-word-embeddings:
Users that are interested in dutch-word-embeddings are comparing it to the libraries listed below
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆135Updated last year
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆82Updated 3 years ago
- A Dutch RoBERTa-based language model☆198Updated 9 months ago
- 110k Dutch Book Reviews Dataset for Sentiment Analysis☆30Updated last year
- The weights for the embedding layer of Scandinavian UMLFiT language models☆33Updated 5 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆77Updated 3 years ago
- A Dataset of German Legal Documents for Named Entity Recognition☆163Updated 2 years ago
- A python wrapper for the multilingual temporal tagger HeidelTime.☆26Updated 2 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆155Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆78Updated 6 months ago
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated 11 months ago
- UIMA CAS processing library written in Python☆86Updated 8 months ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆75Updated 3 years ago
- E3C is a freely available multilingual corpus (Italian, English, French, Spanish, and Basque) of semantically annotated clinical narrativ…