dbmdz/berts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dbmdz/berts)

dbmdz / berts

DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models

☆158

Alternatives and similar repositories for berts

Users that are interested in berts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stefan-it / europeana-bert
View on GitHub
BERT and ELECTRA models trained on Europeana Newspapers
☆39Dec 14, 2021Updated 4 years ago
stefan-it / german-gpt2
View on GitHub
German GPT-2 model
☆32Aug 17, 2021Updated 4 years ago
stefan-it / gc4lm
View on GitHub
GC4LM: A Colossal (Biased) language model for German
☆13May 2, 2021Updated 5 years ago
stefan-it / fine-tuned-berts-seq
View on GitHub
Fine-tuned Transformers compatible BERT models for Sequence Tagging
☆40Jul 17, 2020Updated 6 years ago
tsproisl / SoMaJo
View on GitHub
A tokenizer and sentence splitter for German and English web and social media texts.
☆153Dec 9, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
German-NLP-Group / german-transformer-training
View on GitHub
Plan and train German transformer models.
☆23Feb 22, 2021Updated 5 years ago
t-systems-on-site-services-gmbh / german-elmo-model
View on GitHub
This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.
☆28Dec 15, 2019Updated 6 years ago
dbmdz / clef-hipe
View on GitHub
Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions
☆20Mar 27, 2023Updated 3 years ago
qurator-spk / sbb_ner
View on GitHub
Named Entity Recognition
☆19Feb 13, 2026Updated 5 months ago
deepset-ai / FARM
View on GitHub
Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
☆1,752Dec 20, 2023Updated 2 years ago
stefan-it / ukrainian-electra
View on GitHub
Ukrainian ELECTRA model
☆12Mar 11, 2023Updated 3 years ago
aghie / parsing-as-pretraining
View on GitHub
Parsing only with Pretraining Networks
☆16Jul 25, 2024Updated 2 years ago
UniversalDependencies / UD_German-GSD
View on GitHub
☆20May 6, 2026Updated 2 months ago
dbmdz / historic-ner
View on GitHub
Repository for "Towards Robust Named Entity Recognition for Historic German"
☆18Dec 11, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
pdufter / staticlama
View on GitHub
☆13Apr 16, 2021Updated 5 years ago
t-systems-on-site-services-gmbh / german-wikipedia-text-corpus
View on GitHub
This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings l…
☆23Feb 22, 2022Updated 4 years ago
pdufter / minimult
View on GitHub
Analyzing mBERT's multilinguality in a small laboratory setting
☆13Jun 12, 2023Updated 3 years ago
uds-lsv / GermEval-2018-Data
View on GitHub
This repository contains all manually labeled data from the GermEval-2018 shared task.
☆29Sep 28, 2018Updated 7 years ago
ziqizhang / sti
View on GitHub
Implementation of algorithms for semantic table implementation, including the TableMiner+ method
☆19Sep 1, 2022Updated 3 years ago
tblock / 10kGNAD
View on GitHub
Ten Thousand German News Articles Dataset for Topic Classification
☆88Nov 7, 2022Updated 3 years ago
elenanereiss / Legal-Entity-Recognition
View on GitHub
A Dataset of German Legal Documents for Named Entity Recognition
☆179Oct 19, 2022Updated 3 years ago
DFKI-NLP / RelEx
View on GitHub
RelEx - A simple framework for Relation Extraction built on AllenNLP
☆15Jun 17, 2020Updated 6 years ago
tonianelope / Multilingual-BERT
View on GitHub
Investigating multilingual language models (BERT) by using them for NER in German and English
☆14Apr 30, 2019Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
openredact / expose-text
View on GitHub
This is a prototype of a Python module for simple modification of document files. ➡️ The project has moved to: https://gitlab.opencode.de…
☆19Mar 20, 2026Updated 4 months ago
SaraS92 / CAE_ADD
View on GitHub
☆13Nov 21, 2023Updated 2 years ago
adbar / German-NLP
View on GitHub
Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German
☆527Oct 30, 2024Updated last year
stefan-it / italian-bertelectra
View on GitHub
🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)
☆18Oct 20, 2022Updated 3 years ago
oscar-project / goclassy
View on GitHub
An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
☆86Apr 21, 2021Updated 5 years ago
spyysalo / wiki-bert-pipeline
View on GitHub
Generate BERT vocabularies and pretraining examples from Wikipedias
☆17May 11, 2020Updated 6 years ago
Pleias / OCRoscope
View on GitHub
Small python package to measure OCR quality and other related metrics.
☆26Feb 19, 2024Updated 2 years ago
adrianeboyd / boyd-wnut2018
View on GitHub
Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)
☆17Jul 16, 2024Updated 2 years ago
idb-ita / GilBERTo
View on GitHub
GilBERTo: A pretrained language model based on RoBERTa for Italian
☆73Jan 2, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
UB-Mannheim / reichsanzeiger-nlp
View on GitHub
Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…
☆16Oct 18, 2024Updated last year
LEL-A / GerAlpacaDataCleaned
View on GitHub
German Alpaca Dataset (Cleaned + Translated)
☆26Apr 6, 2023Updated 3 years ago
StarlangSoftware / NGram-Py
View on GitHub
Ngrams with Basic Smoothings
☆19May 31, 2026Updated last month
idiap / Node_weighted_GCN_for_depression_detection
View on GitHub
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews
☆21Jul 1, 2025Updated last year
applicaai / pyramidions
View on GitHub
This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…
☆14May 15, 2022Updated 4 years ago
gooofy / transformer-lm
View on GitHub
Transformer language model (GPT-2) with sentencepiece tokenizer
☆10Oct 15, 2019Updated 6 years ago
pdufter / densray
View on GitHub
Getting interpretable dimensions in word embedding spaces.
☆15Jul 6, 2023Updated 3 years ago