stefan-it / europeana-bertView external linksLinks
BERT and ELECTRA models trained on Europeana Newspapers
☆38Dec 14, 2021Updated 4 years ago
Alternatives and similar repositories for europeana-bert
Users that are interested in europeana-bert are comparing it to the libraries listed below
Sorting:
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- Transformer language model (GPT-2) with sentencepiece tokenizer☆10Oct 15, 2019Updated 6 years ago
- German GPT-2 model☆32Aug 17, 2021Updated 4 years ago
- Plan and train German transformer models.☆23Feb 22, 2021Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 4 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆157Dec 6, 2022Updated 3 years ago
- API wrapper for the Springer Nature API☆24Jan 9, 2026Updated last month
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆21Aug 1, 2024Updated last year
- An extensible viewer for OCR-D mets.xml files☆22May 30, 2024Updated last year
- tesseractXplore a tesseract ease of use gui with full control☆27Nov 10, 2021Updated 4 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 5 years ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14May 15, 2022Updated 3 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- ☆10Dec 22, 2022Updated 3 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated last year
- Fork of dhSegment for experiments on visual and textual feature combination.☆15Jan 30, 2021Updated 5 years ago
- Named Entity Recognition☆18Apr 9, 2025Updated 10 months ago
- Detect and align similar passages☆117Sep 25, 2025Updated 4 months ago
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 9 months ago
- 🤗An unofficial PyTorch implementation of ConvBert based on huggingface/transformers.☆17Oct 6, 2022Updated 3 years ago
- BERT models pretrained on the CORD-19 Kaggle dataset☆15Jun 8, 2020Updated 5 years ago
- Inneall aistriúcháin atá taobh thiar de Chaighdeánaitheoir na Gaeilge, agus aistritheoirí Gàidhlig/Gaelg→Gaeilge☆20Sep 14, 2024Updated last year
- Recognize text using Calamari OCR and the OCR-D framework☆15May 13, 2025Updated 9 months ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- German Alpaca Dataset (Cleaned + Translated)☆26Apr 6, 2023Updated 2 years ago
- Data for the HIPE 2022 shared task.☆21Nov 29, 2023Updated 2 years ago
- Pyro models and misc examples.☆19May 10, 2021Updated 4 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆151Dec 9, 2024Updated last year
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Jul 18, 2019Updated 6 years ago
- CERberus -- guardian against character errors☆29Feb 15, 2024Updated 2 years ago
- Parser für die Plenarprotokolle des Bundestags☆21Jul 17, 2017Updated 8 years ago
- This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.☆28Dec 15, 2019Updated 6 years ago
- ☆62Jan 4, 2023Updated 3 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Mar 21, 2022Updated 3 years ago
- Multimodal Emotion eXpression Capture Amsterdam. Pipeline for capturing emotion expressions from multiple modalities (video, audio, text)…☆36Apr 7, 2025Updated 10 months ago
- A Pythonic API and some command line tools to access the Transkribus server via its REST API☆28Nov 25, 2022Updated 3 years ago
- This repo work as a sandbox enviroment for htrflow.☆39Feb 10, 2026Updated last week