Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions
☆20Mar 27, 2023Updated 2 years ago
Alternatives and similar repositories for clef-hipe
Users that are interested in clef-hipe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data for the HIPE 2022 shared task.☆21Nov 29, 2023Updated 2 years ago
- ☆10Aug 5, 2019Updated 6 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆39Dec 14, 2021Updated 4 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- Jupyter book showing how to build an ML powered book genre classifier☆13Oct 16, 2024Updated last year
- BADLAD: Bengali Document Layout Analysis Dataset☆15May 12, 2024Updated last year
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Use visual programming to build data tables based on text data within the Orange data mining software environment☆30Oct 20, 2025Updated 5 months ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆15Jun 4, 2024Updated last year
- A collection of scripts for teaching and learning basic text mining methods in R☆10Sep 10, 2018Updated 7 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆61Jan 12, 2018Updated 8 years ago
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 4 months ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 4 years ago
- MediaWiki extension that adds support for local media files to Wikibase via a new data type.☆12Oct 2, 2025Updated 5 months ago
- generate shape expressions from CSV☆11Mar 2, 2026Updated 3 weeks ago
- ☆13Sep 7, 2021Updated 4 years ago
- This repository includes all the code and data for the paper ELiDi (End2end Entity Linking and Disambiguation)☆14Jul 18, 2021Updated 4 years ago
- Repository containing controlled vocabularies and data published by DOREMUS☆16Mar 29, 2024Updated last year
- argparse.ArgumentParser wrapper to parse TOML files from the command-line☆22Oct 14, 2025Updated 5 months ago
- Named Entity Recognition☆19Feb 13, 2026Updated last month
- Convert Kindle Clippings to Objects that conform with W3C Web Annotation Vocabulary☆14Sep 18, 2017Updated 8 years ago
- Named Entity Disambiguation and Linking☆16May 24, 2024Updated last year
- A Node.js tool to examine the correctness of Open Data Metadata and build custom dataset profiles☆12Sep 26, 2023Updated 2 years ago
- The GitHub repository for the AI for Humanists Project☆20Jun 9, 2025Updated 9 months ago
- Web-based synthesis of nifty NLP and entity extraction services☆13Oct 25, 2019Updated 6 years ago
- Describing music catalogs☆23Jul 9, 2024Updated last year
- German GPT-2 model☆32Aug 17, 2021Updated 4 years ago
- Text-based media editing interface☆16Aug 9, 2017Updated 8 years ago
- ☆10Mar 14, 2025Updated last year
- A tool for automatic spelling normalization☆21Jan 18, 2021Updated 5 years ago
- Wikibase extension that allows defining RDF mappings for Wikibase Entities☆16Feb 2, 2026Updated last month
- ☆13Feb 23, 2026Updated last month
- ☆15Dec 12, 2024Updated last year
- Serialization component for the Asphalt framework☆11Mar 16, 2026Updated last week
- ☆22Jan 20, 2024Updated 2 years ago
- Example SPARQL queries, mostly for working with ZBW data sets☆16Oct 8, 2025Updated 5 months ago
- Convert Wikidata Items to vector embeddings☆37Feb 25, 2026Updated 3 weeks ago
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- ☆13Sep 28, 2020Updated 5 years ago