Living-with-machines / alto2txt
Convert ALTO XML to plain text + minimal metadata
☆13Updated last month
Related projects ⓘ
Alternatives and complementary repositories for alto2txt
- Awesome AI in Libraries☆16Updated last year
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆11Updated 3 months ago
- A module for Omeka S that provides an API for the Neatline 3 single page application☆13Updated last year
- extract text from ALTO file☆9Updated last year
- IIIF Audio/Video Player☆14Updated last year
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆24Updated 2 years ago
- QA-tool for scans with corresponding ALTO-files☆22Updated last year
- Create knowledge graphs with Markdown☆32Updated 7 months ago
- A Python database interface for eXist-db☆14Updated 2 weeks ago
- Exercises for the XQuery Workshops at XQuery at DH2017☆47Updated 6 years ago
- World Historical Gazetteer platform☆18Updated 2 months ago
- A codebase to support a pure JSON search engine requiring no backend for any XHTML5 document collection☆51Updated this week
- Cidoc cRm In Triples mERmaid dIagrAms☆23Updated 3 months ago
- Web application to build XML stand-off markup☆15Updated 3 years ago
- Named-Entity Recognition extension for OpenRefine☆24Updated last year
- The main TEI Publisher app☆68Updated last month
- Conversions between various OCR formats☆71Updated last year
- A Python Markdown extension that lets authors embed RDFa Lite in markdown documents rendered to HTML.☆38Updated 5 years ago
- Given the URL to a public JSON document in an International Image Interoperability Framework (IIIF) image server, this script will downlo…☆15Updated 2 years ago
- Patterns based on the W3C Web Annotation Model, primarily for use in linking resources describing historical phenomena with the places re…☆11Updated 4 years ago
- Python tools for performing various operations on ALTO XML files☆39Updated last year
- A static site generator for TEI Publisher☆12Updated 2 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- ☆24Updated 3 years ago
- Repository for GitDOX, a GitHub Data-storage Online XML editor☆15Updated 7 months ago
- Python for Humanities☆13Updated this week
- No longer maintained. Please use conciliator instead.☆26Updated 4 years ago
- Image processing tools, with a focus on digital preservation☆28Updated 7 months ago
- Vocabseditor is a web-based tool for collaborative work on controlled vocabularies development☆23Updated 5 months ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Updated last month