Browser-based app for segmenting & OCRing PDF pages based on whitespace rules. To assist researchers (especially in the humanities) with turning their materials into machine-actionable datasets.
☆12Mar 2, 2024Updated 2 years ago
Alternatives and similar repositories for document-segmentation
Users that are interested in document-segmentation are comparing it to the libraries listed below
Sorting:
- A framework for Oxygen XML Editor allowing researchers to transcribe historical documents in TEI☆21Jun 24, 2024Updated last year
- Automated listing of repos in GitHub with XML files containing teiHeader. Find a project using TEI today!☆17Updated this week
- In an effort to decrease the execution time of the OCR process, a multi-processing script was created using Python's multi-processing mod…☆10Dec 6, 2019Updated 6 years ago
- Encoding the Bible in TEI, starting with the Gospels☆26Aug 18, 2025Updated 7 months ago
- Landing Page☆11Jan 19, 2026Updated 2 months ago
- Website for "Data-sitters Club"☆19Sep 5, 2025Updated 6 months ago
- Special Topics in AI: Artificial Intelligence as an Archival Science☆20May 13, 2024Updated last year
- Cut away words from digital books and render the resulting images☆21Feb 3, 2026Updated last month
- The texts used for building Archive for Danish Literature☆12Jun 24, 2024Updated last year
- ARK minter, binder, resolver☆23Feb 25, 2026Updated 3 weeks ago
- A highly customizable, lightweight mastodon feed embed component☆63Jul 24, 2023Updated 2 years ago
- Digital Mappa (DM for short) is a freely available online environment for creating projects out of digital images and texts.☆23Dec 15, 2025Updated 3 months ago
- ☆12Apr 24, 2017Updated 8 years ago
- ☆10Apr 16, 2020Updated 5 years ago
- For those who want to make their own hack using my Totsugeki code. I ask that you credit me if this is used.☆32May 6, 2022Updated 3 years ago
- A free-to-use Adobe InDesign template for scholarly publishing☆18Aug 14, 2023Updated 2 years ago
- A QGIS3 plugin to create a water network (sewer network, river network)☆15Nov 19, 2025Updated 4 months ago
- TEI Publisher Learning Hub☆26Feb 22, 2026Updated last month
- Tools for interactive grid creation and manipulation in the console☆14Jun 19, 2022Updated 3 years ago
- Best Practices for TEI in Libraries: A guide for mass digitization, automated workflows, and promotion of interoperability with XML using…☆34Sep 9, 2018Updated 7 years ago
- An Obsidian plugin for executing Sage computations in notes.☆14Dec 24, 2022Updated 3 years ago
- R script for visualising patient ward movements as timelines☆13May 13, 2022Updated 3 years ago
- R package for Extended Date/Time Format (EDTF)☆16Jun 2, 2025Updated 9 months ago
- Incorporates external dependencies into HTML file using data: URI scheme☆21Nov 17, 2011Updated 14 years ago
- ☆39Jun 6, 2024Updated last year
- A fork of Mastodon designed for civic communities looking to run their own social networks.☆37Aug 15, 2023Updated 2 years ago
- 🎨Community driven colour palettes☆12May 1, 2020Updated 5 years ago
- Quarto dashboard examples using OJS cells☆13Jun 27, 2024Updated last year
- An attempt to turn copy and pasting into linking☆13Jun 1, 2021Updated 4 years ago
- Social Network Analysis and STEM Education is designed to prepare researchers to apply network analysis in order to better understand and…☆14Jul 14, 2025Updated 8 months ago
- Tiny vi text editor clone with enough features to be truly useful☆15Feb 14, 2024Updated 2 years ago
- ☆10Nov 15, 2025Updated 4 months ago
- “Open terminals”, “load CSVs”, “start hacking”☆16May 2, 2017Updated 8 years ago
- small script for managing google scholar alert emails☆11May 6, 2023Updated 2 years ago
- Life & Times of a Reproducible Clinical Project☆15Jan 15, 2019Updated 7 years ago
- Simple rules based grapheme to phoneme in Python☆11Sep 2, 2017Updated 8 years ago
- Help documentation to build COG files with GDAL☆11Jan 21, 2025Updated last year
- ☆11Jan 22, 2018Updated 8 years ago
- Legacy official MegaZeux git repository. Use http://github.com/AliceLR/megazeux instead.☆14Jul 19, 2018Updated 7 years ago