Detect and align similar passages
☆117Sep 25, 2025Updated 5 months ago
Alternatives and similar repositories for passim
Users that are interested in passim are comparing it to the libraries listed below
Sorting:
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- An approximate nearest-neighbor search for text reuse.☆12Oct 5, 2020Updated 5 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆23Feb 21, 2018Updated 8 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- A software to detect text reuse with BLAST.☆13Oct 8, 2019Updated 6 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆15Jan 20, 2026Updated last month
- Create PDFs from IIIF manifests, completely client-side (with server-based fallback for unsupported browsers)☆46Oct 4, 2025Updated 4 months ago
- Srophé Application. A TEI publishing application.☆17Nov 4, 2024Updated last year
- Python implementation of the Zeta score for contrastive text analysis☆14Jun 16, 2021Updated 4 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- the EEBO TCP texts☆36Feb 21, 2018Updated 8 years ago
- Python 3 library for processing historical English☆68Aug 10, 2024Updated last year
- Repository for the book Among Digitized Manuscripts by L.W. Cornelis van Lit (Leiden: Brill, 2020)☆25Feb 27, 2020Updated 6 years ago
- TEI in HTML5 Custom Elements☆174Updated this week
- A framework for Oxygen XML Editor allowing researchers to transcribe historical documents in TEI☆21Jun 24, 2024Updated last year
- Cookiecutter template for a Static-Site Digital Scholarly Edition☆15Dec 22, 2025Updated 2 months ago
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 7 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Dec 14, 2021Updated 4 years ago
- CERberus -- guardian against character errors☆29Feb 15, 2024Updated 2 years ago
- Toolbox for OCR post-correction☆122Sep 19, 2019Updated 6 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated last year
- Fast, permanent and flexible patterns for sharing and computing on texts with metadata using Apache Arrow.☆15Mar 1, 2022Updated 4 years ago
- VIKUS IIIF Generator☆17Oct 28, 2025Updated 4 months ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Mar 21, 2022Updated 3 years ago
- ☆29Feb 19, 2026Updated last week
- A language-independent post-correction app for POS-tagging and lemmatization☆30May 28, 2025Updated 9 months ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Oct 16, 2024Updated last year
- An OCR evaluation tool☆69Aug 22, 2025Updated 6 months ago
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29May 13, 2020Updated 5 years ago
- Library in C++ and a python wrapper for dealing with Page XML files☆13Apr 25, 2025Updated 10 months ago
- Special Topics in AI: Artificial Intelligence as an Archival Science☆20May 13, 2024Updated last year
- Create fast & light digital projects with natural markdown and IIIF materials.☆58Updated this week
- Django web application to display, annotate, and export digitized books.☆33Feb 17, 2026Updated last week
- You Actually Look Twice At it☆38Jan 21, 2025Updated last year
- Wrapper around Zekun's model to detect and generate annotations around map labels☆27Aug 15, 2023Updated 2 years ago
- Simple app for visual editing of Page XML files☆31Sep 25, 2025Updated 5 months ago