CITlabRostock / citlab-article-separation-newLinks
Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Horizon 2020 project NewsEye. For more information about the project see https://www.newseye.eu/.
☆22Updated 3 years ago
Alternatives and similar repositories for citlab-article-separation-new
Users that are interested in citlab-article-separation-new are comparing it to the libraries listed below
Sorting:
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Updated last week
- An extensible viewer for OCR-D mets.xml files☆22Updated last year
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Updated 7 months ago
- ☆14Updated 3 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Updated last month
- Tools for normalizing the use of some characters and checking file consistencies☆11Updated 11 months ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Updated last year
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Updated last year
- Named entity annotation tool☆28Updated 2 years ago
- You Actually Look Twice At it☆37Updated 11 months ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Updated 4 years ago
- ☆63Updated last week
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Updated last year
- Check your modified Ground Truth files with visual support!☆10Updated last year
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆20Updated this week
- A repository for online OCRD training infrastructure.☆13Updated 5 years ago
- Conversions between various OCR formats☆82Updated 2 years ago
- OCRopus model for Gothic print (Fraktur)☆19Updated 5 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated 5 months ago
- Python tools for performing various operations on ALTO XML files☆48Updated 9 months ago
- ☆32Updated 3 months ago
- Training files for Greek cursive script (in early print)☆15Updated 4 years ago
- Fork of dhSegment for experiments on visual and textual feature combination.☆15Updated 4 years ago
- An OCR evaluation tool☆68Updated 4 months ago
- OCR-D wrapper for detectron2 based segmentation models☆17Updated 7 months ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆58Updated 2 months ago
- Obsolete repo, merged into eynollah☆12Updated 2 months ago
- Core libraries by the PRImA Research Lab☆16Updated last year
- Converters for various file formats used for representing OCR☆12Updated 7 months ago
- Highlighting various OCR formats directly in Solr☆86Updated last week