Python 3 library for processing historical English
โ68Aug 10, 2024Updated last year
Alternatives and similar repositories for natas
Users that are interested in natas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The amazing ๐will normalize non-standard Finnish/Swedish and dialectalize standard Finnish!โ31Aug 10, 2024Updated last year
- The NLG tool for Finnishโ24Dec 13, 2023Updated 2 years ago
- Toolbox for OCR post-correctionโ120Sep 19, 2019Updated 6 years ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanโฆโ97Mar 12, 2026Updated 2 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XMLโ12Dec 10, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean โข AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Post-processing OCR errors with seq2seq modelsโ28Jul 30, 2020Updated 5 years ago
- Public API cache proxy built on the Earth Science Online Video Database, an Airtable base, which also syncs to Zotero and broadcasts new โฆโ13Apr 11, 2026Updated last month
- Web application for transcribing OCR ground truth from Archive.orgโ18Feb 22, 2018Updated 8 years ago
- Correction of spaces with character-based neural language models.โ13Aug 23, 2022Updated 3 years ago
- Detect and align similar passagesโ122Apr 27, 2026Updated last month
- Awesome AI in Librariesโ17Jul 21, 2023Updated 2 years ago
- OCRopus model for Gothic print (Fraktur)โ19Feb 16, 2020Updated 6 years ago
- โ13Dec 28, 2022Updated 3 years ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator datasetโ17Oct 16, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Newspaper Segmentation into images and textโ12Jan 11, 2019Updated 7 years ago
- IIIF Examples and useful codeโ20Sep 10, 2025Updated 9 months ago
- Scrape and structure raw data from the Norwegian parliament's API.โ12Oct 24, 2025Updated 7 months ago
- โ10Mar 16, 2023Updated 3 years ago
- convert PubLayNet data into METS/PAGE-XMLโ10Mar 17, 2020Updated 6 years ago
- Tools for TICCLโ14Dec 12, 2025Updated 5 months ago
- Scripts that clean up OCR and munge Hathi metadata.โ78Nov 4, 2017Updated 8 years ago
- โ27Feb 2, 2021Updated 5 years ago
- Presentations, tutorials and data for the OCR workshop at LMUโ16Jun 2, 2017Updated 9 years ago
- Virtual machines for every use case on DigitalOcean โข AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- โ269Jul 7, 2025Updated 11 months ago
- โ13Jun 25, 2019Updated 6 years ago
- โ142Mar 5, 2024Updated 2 years ago
- The GitHub repository for the AI for Humanists Projectโ21Jun 9, 2025Updated last year
- Back-end part of the whole feedient.com service which shut down April 30th, 2015.โ11Aug 7, 2015Updated 10 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preuรischer Staatsanzeiger" (1819โ19โฆโ16Oct 18, 2024Updated last year
- Norwegian Speech Transformer Modelsโ19Mar 26, 2026Updated 2 months ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)โ17Updated this week
- โ15Jul 11, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient โข AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Sentiment Corpus for Swedish ๐ธ๐ช Norwegian ๐ณ๐ด Danish ๐ฉ๐ฐ Finnish ๐ซ๐ฎ (and English ๐ด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ)โ15May 3, 2021Updated 5 years ago
- Script for workflow to add morphological analysis into ELAN filesโ14May 15, 2020Updated 6 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phrasโฆโ11Dec 13, 2018Updated 7 years ago
- This repo work as a sandbox enviroment for htrflow.โ40Mar 19, 2026Updated 2 months ago
- This repository contains the Wikibase configuration of the EU Knowledge Graphโ14May 4, 2026Updated last month
- The Digital Humanities Literacy Guidebookโ69Nov 11, 2022Updated 3 years ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."โ41Nov 29, 2021Updated 4 years ago