CITlabRostock / citlab-article-separation-newLinks
Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Horizon 2020 project NewsEye. For more information about the project see https://www.newseye.eu/.
☆21Updated 2 years ago
Alternatives and similar repositories for citlab-article-separation-new
Users that are interested in citlab-article-separation-new are comparing it to the libraries listed below
Sorting:
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Updated last year
- Some bits of javascript to transcribe scanned pages using PageXML☆17Updated last year
- Tools for normalizing the use of some characters and checking file consistencies☆11Updated 7 months ago
- An extensible viewer for OCR-D mets.xml files☆21Updated last year
- ☆13Updated 3 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Updated 3 months ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆13Updated last year
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Updated last year
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆19Updated last week
- ☆60Updated last month
- Named entity annotation tool☆28Updated 2 years ago
- A repository for online OCRD training infrastructure.☆13Updated 5 years ago
- You Actually Look Twice At it☆35Updated 7 months ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Updated 3 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Updated 10 months ago
- Check your modified Ground Truth files with visual support!☆10Updated last year
- Python tools for performing various operations on ALTO XML files☆48Updated 6 months ago
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆12Updated last week
- An OCR evaluation tool☆66Updated last week
- OCR-D wrapper for detectron2 based segmentation models☆17Updated 3 months ago
- Conversions between various OCR formats☆79Updated 2 years ago
- ☆31Updated this week
- Simple app for visual editing of Page XML files☆31Updated 3 months ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- Fork of dhSegment for experiments on visual and textual feature combination.☆15Updated 4 years ago
- Ground Truth Resources for the HTR of patrimonial documents☆44Updated this week
- Master repository which includes most other OCR-D repositories as submodules☆73Updated last month
- OCR-D python tools☆33Updated last year
- Training files for Greek cursive script (in early print)☆15Updated 4 years ago
- Named Entity Recognition☆18Updated 4 months ago