qurator-spk / mods4pandas
Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis
☆11Updated 2 months ago
Alternatives and similar repositories for mods4pandas:
Users that are interested in mods4pandas are comparing it to the libraries listed below
- A CLI tool that generates IIIF Presentation 2.1 Manifests from METS/MODS☆23Updated 2 months ago
- A repository for illustrating the transformation of a PAGE XML file into XML-TEI format, resulting from experimentations made for the LEC…☆17Updated 2 years ago
- Implementation of OLA-HD ("Ein OCR-D-Langzeitarchiv für historische Drucke")☆9Updated 2 years ago
- An extensible viewer for OCR-D mets.xml files☆20Updated 8 months ago
- IIIF Examples and useful code☆18Updated 5 months ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Updated last year
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆35Updated 2 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Updated 4 months ago
- ☆29Updated 3 weeks ago
- Specifications for the DTS API☆28Updated 3 months ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Updated 3 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆14Updated 3 months ago
- Correspondence Metadata Interchange Format☆20Updated last month
- Awesome AI in Libraries☆16Updated last year
- Docker integration of Kitodo.Production and OCR-D☆9Updated 11 months ago
- OCR-D wrapper for prima-pagetopdf☆8Updated 3 months ago
- IIIF Presentation API implementation in Python☆35Updated 9 months ago
- Text Overlay plugin for Mirador 3☆50Updated last month
- Library to parse and create METS files, especially for Archivematica.☆21Updated last month
- Python tools for performing various operations on ALTO XML files☆44Updated this week
- Tentative way towards a shared API for prosopographical data based on the factoid model (Bradley/Short 2005)☆24Updated 2 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Updated 10 months ago
- OCR a IIIF images in a manifest and generate annotations☆24Updated this week
- Command-line tools to transform TEI & METS files to IIIF Presentation API manifests☆19Updated 8 years ago
- You Actually Look Twice At it☆30Updated 3 weeks ago
- ☆4Updated 3 years ago
- chrome extension to detect IIIF content in web pages☆20Updated 2 years ago
- TEI Manuscript Description ODD Customisation☆16Updated 9 months ago