altoxml / schema
ALTO XML schema - latest and all former versions
☆52Updated 9 months ago
Alternatives and similar repositories for schema:
Users that are interested in schema are comparing it to the libraries listed below
- Conversions between various OCR formats☆75Updated last year
- Python tools for performing various operations on ALTO XML files☆46Updated last month
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆55Updated 9 months ago
- Documentation and use cases for ALTO XML☆41Updated 6 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- The CIS OCR PostCorrectionTool☆42Updated 2 years ago
- Named entity annotation tool☆27Updated last year
- A plugin that provides support for working with Digital Facsimiles in Text Encoding Initiative (TEI) vocabulary. The plugin contribute…☆25Updated 3 years ago
- Exercises for the XQuery Workshops at XQuery at DH2017☆50Updated 6 years ago
- EFES (EpiDoc Front End Services) is a custom and readily customizable platform for publication and search/indexing of EpiDoc files, based…☆31Updated 2 months ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated last year
- A Pythonic API and some command line tools to access the Transkribus server via its REST API☆27Updated 2 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 7 years ago
- ☆14Updated 2 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Updated 3 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆14Updated last month
- The base class from which to create a CWRC-Writer XML editor.☆14Updated 2 years ago
- An implementation of the TEI Simple ODD extensions for processing models in XQuery.☆22Updated 5 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 3 years ago
- DTA Base Format (DTABf)☆18Updated last month
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆11Updated 4 months ago
- You Actually Look Twice At it☆33Updated 2 months ago
- Correspondence Metadata Interchange Format☆20Updated last month
- Named Entity Recognition API used by TEI Publisher☆18Updated 10 months ago
- Tentative way towards a shared API for prosopographical data based on the factoid model (Bradley/Short 2005)☆24Updated 2 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Updated 9 months ago
- QA catalogue – a metadata quality assessment tool for library catalogue records (MARC, PICA)☆83Updated this week
- Digitale Geisteswissenschaften rund um Graphentechnologien☆8Updated last month
- Specifications for the DTS API☆28Updated 5 months ago