Python 3 library for processing historical English
☆68Aug 10, 2024Updated last year
Alternatives and similar repositories for natas
Users that are interested in natas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Nov 14, 2021Updated 4 years ago
- Tools for assessing Finnish poetry: rhymes, meter, hyphenation of Finnish and so on.☆13Dec 13, 2023Updated 2 years ago
- The amazing 🐕will normalize non-standard Finnish/Swedish and dialectalize standard Finnish!☆31Aug 10, 2024Updated last year
- Post-processing OCR errors with seq2seq models☆28Jul 30, 2020Updated 5 years ago
- Public API cache proxy built on the Earth Science Online Video Database, an Airtable base, which also syncs to Zotero and broadcasts new …☆13Apr 11, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Web application for transcribing OCR ground truth from Archive.org☆18Feb 22, 2018Updated 8 years ago
- Correction of spaces with character-based neural language models.☆13Aug 23, 2022Updated 3 years ago
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated last year
- Detect and align similar passages☆121Apr 27, 2026Updated 3 weeks ago
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Oct 16, 2024Updated last year
- IIIF Examples and useful code☆20Sep 10, 2025Updated 8 months ago
- Scrape and structure raw data from the Norwegian parliament's API.☆12Oct 24, 2025Updated 6 months ago
- ☆10Mar 16, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- Tools for TICCL☆14Dec 12, 2025Updated 5 months ago
- Scripts that clean up OCR and munge Hathi metadata.☆77Nov 4, 2017Updated 8 years ago
- ☆27Feb 2, 2021Updated 5 years ago
- Presentations, tutorials and data for the OCR workshop at LMU☆16Jun 2, 2017Updated 8 years ago
- ☆269Jul 7, 2025Updated 10 months ago
- ☆141Mar 5, 2024Updated 2 years ago
- The GitHub repository for the AI for Humanists Project☆21Jun 9, 2025Updated 11 months ago
- Back-end part of the whole feedient.com service which shut down April 30th, 2015.☆11Aug 7, 2015Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Norwegian Speech Transformer Models☆19Mar 26, 2026Updated last month
- ☆15Jul 11, 2022Updated 3 years ago
- Glyph Miner, a system for extracting glyphs from early typeset prints☆34Sep 29, 2016Updated 9 years ago
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Jan 26, 2022Updated 4 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Dec 13, 2018Updated 7 years ago
- This repo work as a sandbox enviroment for htrflow.☆40Mar 19, 2026Updated 2 months ago
- This repository contains the Wikibase configuration of the EU Knowledge Graph☆14May 4, 2026Updated 2 weeks ago
- The Digital Humanities Literacy Guidebook☆68Nov 11, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆41Nov 29, 2021Updated 4 years ago
- Editor for aligned parallel texts (personal desktop application).☆20Jan 15, 2026Updated 4 months ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆23Feb 21, 2018Updated 8 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- Named Entity Recognition☆19Feb 13, 2026Updated 3 months ago
- NLP pipeline software using common workflow language☆35Apr 22, 2019Updated 7 years ago
- Distributed AtomSpace Network client☆19Jan 19, 2026Updated 4 months ago