lizfischer / document-segmentation
Browser-based app for segmenting & OCRing PDF pages based on whitespace rules. To assist researchers (especially in the humanities) with turning their materials into machine-actionable datasets.
☆10Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for document-segmentation
- A framework for Oxygen XML Editor allowing researchers to transcribe historical documents in TEI☆21Updated 4 months ago
- Special Topics in AI: Artificial Intelligence as an Archival Science☆16Updated 5 months ago
- Automated listing of repos in GitHub with XML files containing teiHeader. Find a project using TEI today!☆15Updated this week
- Code repository for whatisdigitalhumanities.com☆30Updated 2 years ago
- Repository for the book Among Digitized Manuscripts by L.W. Cornelis van Lit (Leiden: Brill, 2020)☆20Updated 4 years ago
- High-performance text aligner for large collections of texts☆45Updated 2 weeks ago
- A standalone React/Redux web application for for presenting unique printed books and manuscripts in digital facsimile.☆32Updated last year
- Python for Humanities☆13Updated this week
- Pipeline for the production of digital scholarly editions of archival collections☆11Updated 8 months ago
- Digital Humanities Across Borders☆46Updated 7 months ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆24Updated 2 years ago
- adno.app. The ADNO source code. adno.app. Adno is a web application for viewing, editing and sharing narratives and pathways on static im…☆25Updated this week
- ☆41Updated 2 months ago
- A hands-on activity in linking and enriching geo-data, part of the Linked Pasts conference☆14Updated 3 years ago
- Miscellaneous data analysis tools and scripts for the EHRI project☆12Updated 9 months ago
- ☆28Updated 3 years ago
- Cambridge Digital Humanities 'Introduction to Text-Mining with Python' (workshops 1 and 2)☆20Updated last year
- Oral History/Qualitative Interview Data Analysis and Publication Tool☆18Updated last year
- A digital humanities operating system that runs on a USB disk.☆30Updated 7 years ago
- Personal modeling application for Linked Data.☆26Updated 5 years ago
- Jupyter book showing how to build an ML powered book genre classifier☆12Updated 3 weeks ago
- Srophé Application. A TEI publishing application.☆17Updated last week
- Python implementation of the Zeta score for contrastive text analysis☆14Updated 3 years ago
- command line resource for working with digital primary sources☆27Updated 6 years ago
- A Mashup Interface for Text Analysis Operations☆13Updated 2 weeks ago
- ☆12Updated 2 years ago
- Best Practices for TEI in Libraries: A guide for mass digitization, automated workflows, and promotion of interoperability with XML using…☆32Updated 6 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- ☆25Updated 2 weeks ago