CITlabRostock / citlab-article-separation-new
Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Horizon 2020 project NewsEye. For more information about the project see https://www.newseye.eu/.
☆20Updated 2 years ago
Alternatives and similar repositories for citlab-article-separation-new:
Users that are interested in citlab-article-separation-new are comparing it to the libraries listed below
- You Actually Look Twice At it☆30Updated last month
- An extensible viewer for OCR-D mets.xml files☆20Updated 8 months ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Updated 11 months ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Updated 3 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Updated last year
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆14Updated 4 months ago
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆11Updated 2 months ago
- Named entity annotation tool☆27Updated last year
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆18Updated 2 weeks ago
- Docker integration of Kitodo.Production and OCR-D☆9Updated 11 months ago
- Check your modified Ground Truth files with visual support!☆10Updated last year
- ☆12Updated 2 years ago
- Python tools for performing various operations on ALTO XML files☆45Updated last week
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Updated 4 months ago
- Ground Truth Resources for the HTR of patrimonial documents☆40Updated this week
- A Pythonic API and some command line tools to access the Transkribus server via its REST API☆27Updated 2 years ago
- ☆31Updated last month
- Conversions between various OCR formats☆74Updated last year
- ☆57Updated last week
- A repository for illustrating the transformation of a PAGE XML file into XML-TEI format, resulting from experimentations made for the LEC…☆17Updated 2 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 6 years ago
- Named Entity Recognition☆17Updated 3 months ago
- ☆14Updated 2 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- OCR-D wrapper for prima-pagetopdf☆8Updated 3 months ago