Data for the HIPE 2022 shared task.
☆21Nov 29, 2023Updated 2 years ago
Alternatives and similar repositories for HIPE-2022-data
Users that are interested in HIPE-2022-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆15Jun 4, 2024Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆20Mar 27, 2023Updated 3 years ago
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆11Sep 26, 2022Updated 3 years ago
- Libraries, Archives and Museums (LAM)☆88Oct 4, 2022Updated 3 years ago
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆22Sep 2, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- OCR post correction for old German corpus☆20Aug 29, 2022Updated 3 years ago
- ☆10Aug 5, 2019Updated 6 years ago
- Named Entity Recognition☆19Feb 13, 2026Updated last month
- ☆14Jul 11, 2022Updated 3 years ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Metrical position in Greek hexameter.☆13Mar 31, 2026Updated last week
- ☆26Jul 11, 2022Updated 3 years ago
- Turn CTS TEI corpora into CEX collection files☆12Jun 16, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- German GPT-2 model☆32Aug 17, 2021Updated 4 years ago
- A bunch of modules that use/extend CLTK in order to work with Greek and Latin corpora maintained by the Perseus DL☆12Oct 26, 2019Updated 6 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- HuCit KB: a knowledge base of classical texts and citable text units.☆11Nov 17, 2021Updated 4 years ago
- ☆14Jul 12, 2022Updated 3 years ago
- Contextualized per-token embeddings☆34May 11, 2025Updated 11 months ago
- Self hosting code for Recogito-Studio☆22Updated this week
- ☆10Feb 20, 2026Updated last month
- ☆20Feb 17, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Unofficial implementation of QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition.☆64Oct 15, 2022Updated 3 years ago
- Patterns based on the W3C Web Annotation Model, primarily for use in linking resources describing historical phenomena with the places re…☆16Mar 6, 2020Updated 6 years ago
- Archive of the XML files of the Mannheim / Heidelberg CAMENA Neo-Latin project☆20Oct 10, 2018Updated 7 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆39Dec 14, 2021Updated 4 years ago
- Generating graph structures from OWL ontologies☆12Nov 21, 2017Updated 8 years ago
- Detect and align similar passages☆119Mar 17, 2026Updated 3 weeks ago
- This is the official repository for the paper "Laplacian Features for Learning with Hyperbolic Space"☆14Aug 8, 2022Updated 3 years ago
- ☆15Oct 21, 2023Updated 2 years ago
- Code for TACL 2020 paper "An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models"☆14Jul 31, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A Pythonic API and some command line tools to access the Transkribus server via its REST API☆28Nov 25, 2022Updated 3 years ago
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆21Nov 10, 2024Updated last year
- Code and Data for Evaluation WG☆42May 4, 2022Updated 3 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- An open source web application for creating interactive maps for history and heritage. Created by the Bartlett Centre for Advanced Spatia…☆26Sep 4, 2025Updated 7 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 4 months ago
- A collection of notebooks for Natural Language Processing☆25Jan 13, 2025Updated last year