wietsedv / bertjeLinks

BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s layers? A closer look at the NLP pipeline in monolingual and multilingual models"

☆138

Alternatives and similar repositories for bertje

Users that are interested in bertje are comparing it to the libraries listed below

Sorting:

iPieter / RobBERT
A Dutch RoBERTa-based language model
☆205Updated last year
jenojp / negspacy
spaCy pipeline object for negating concepts in text
☆281Updated last month
ebanalyse / NERDA
Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks
☆159Updated 2 years ago
dbmdz / berts
DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models
☆156Updated 2 years ago
ELS-RD / anonymisation
Anonymization of legal cases (Fr) based on Flair embeddings
☆88Updated 4 years ago
tblock / 10kGNAD
Ten Thousand German News Articles Dataset for Topic Classification
☆84Updated 2 years ago
paulrinckens / timexy
A spaCy custom component that extracts and normalizes temporal expressions
☆54Updated 2 years ago
fnl / syntok
Text tokenization and sentence segmentation (segtok v2)
☆205Updated 3 years ago
certainlyio / nordic_bert
Pre-trained Nordic models for BERT
☆174Updated 3 years ago
tsproisl / SoMaJo
A tokenizer and sentence splitter for German and English web and social media texts.
☆146Updated 7 months ago
MartinoMensio / spacy-universal-sentence-encoder
Google USE (Universal Sentence Encoder) for spaCy
☆184Updated 2 years ago
alexandrainst / danlp
DaNLP is a repository for Natural Language Processing resources for the Danish Language.
☆206Updated 5 months ago
sorenlind / lemmy
🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪
☆77Updated 3 years ago
centre-for-humanities-computing / DaCy
DaCy: The State of the Art Danish NLP pipeline using SpaCy
☆96Updated 6 months ago
flairNLP / flair-lms
Language Models for Zalando's flair library
☆61Updated 5 years ago
TakeLab / spacy-udpipe
spaCy + UDPipe
☆161Updated 3 years ago
clips / dutchembeddings
Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…
☆83Updated 4 years ago
kabirkhan / recon
Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …
☆106Updated last year
dkpro / dkpro-cassis
UIMA CAS processing library written in Python
☆90Updated 3 weeks ago
davidberenstein1957 / concise-concepts
This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…
☆245Updated 2 years ago
argilla-io / spacy-wordnet
spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
☆260Updated 10 months ago
ieriii / spacy-annotator
Spacy NER annotator using ipywidgets
☆124Updated last year
elenanereiss / Legal-Entity-Recognition
A Dataset of German Legal Documents for Named Entity Recognition
☆171Updated 2 years ago
moussaKam / BARThez
A french sequence to sequence pretrained model
☆61Updated 2 years ago
KennethEnevoldsen / spacy-wrap
spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…
☆46Updated last year
gandersen101 / spaczz
Fuzzy matching and more functionality for spaCy.
☆256Updated last year
BramVanroy / spacy_conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…
☆80Updated last year
kevinlu1248 / pyate
PYthon Automated Term Extraction
☆314Updated 2 years ago
KennethEnevoldsen / augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.
☆156Updated last year
MantisAI / nervaluate
Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13
☆182Updated last week