This repo contains files downloaded from Transkribus with corresponding suggested OCR improvements (performed using ChatGPT AI).
☆19Mar 3, 2026Updated 2 months ago
Alternatives and similar repositories for LLM-powered-OCR-correction
Users that are interested in LLM-powered-OCR-correction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and appl…☆20Jun 5, 2025Updated 11 months ago
- This project contains the code to use custom fasttext embeddings with flair framework.☆11May 2, 2025Updated last year
- Unofficial implementation of the paper "MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition" by Bhunia et al. (2021).☆13Jun 22, 2022Updated 3 years ago
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- A Zotero plugin that automatically retrieves and updates paper metadata from multiple academic sources based on paper titles.☆28Mar 17, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Arabic OCR OCR system for Arabic language that converts images (multi-fonts) of typed text to machine-encoded text. The system currently …☆10Oct 12, 2021Updated 4 years ago
- The Weather Map is a visual model inspired by the synoptic weather charts for the map of controversies.☆11Feb 7, 2024Updated 2 years ago
- Explore Building Computer Use Agents with Gemini 2.0☆19Dec 12, 2024Updated last year
- A collection of notebooks for Natural Language Processing☆25Jan 13, 2025Updated last year
- ☆13Oct 11, 2023Updated 2 years ago
- simple NMT With Attention For Arabic to English☆11Mar 5, 2022Updated 4 years ago
- An open-source, browser-based front-end application for the collection of complex structured data from textual resources in history and t…☆16May 13, 2026Updated last week
- Lightweight Traefik middleware plugin that enable users to authenticate on specific domains using GitHub OAuth☆16Mar 30, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Moved to https://codeberg.org/araichev/kml2geojson.☆16May 5, 2026Updated 2 weeks ago
- Testing out HTR-OCR-Text translation using Google's Tesseract engine in real-time.☆20Oct 6, 2020Updated 5 years ago
- HTRflow is the underlying engine for our HTR-pipeline☆75Apr 9, 2026Updated last month
- A VUE IIIF viewer☆15Dec 14, 2025Updated 5 months ago
- Online Handwritten Text Recognition (HTR) system implemented with PyTorch. Based on https://doi.org/10.1007/s10032-020-00350-4.☆23May 13, 2026Updated last week
- spaCy-compatible sm/md/lg/trf core models for Latin, i.e pipeline with POS tagger, morphologizer, lemmatizer, dependency parser, and NER☆12Aug 26, 2025Updated 8 months ago
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆13Aug 21, 2025Updated 8 months ago
- Geolocation visualization project for Foodies (a gurgaon based food chain)☆23Jul 29, 2020Updated 5 years ago
- Create PDFs from IIIF manifests, completely client-side (with server-based fallback for unsupported browsers)☆49Oct 4, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Image Binarization for improving OCR and HTR☆23Aug 18, 2022Updated 3 years ago
- Source code for WACV20 paper "Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition".☆16Jun 12, 2020Updated 5 years ago
- Folium plugin to provide fast webgl rendering for GeoJSON FeatureCollections☆15Jun 28, 2024Updated last year
- Python script to read temperature and humidity from Switchbot Meter and send via MQTT (for Home Assistant etc)☆20Jan 9, 2020Updated 6 years ago
- Ground Truth Resources for the HTR of patrimonial documents☆49May 10, 2026Updated last week
- Cerno is a local-first research platform that leverages agentic AI to break down complex queries into verifiable, multi-step workflows. S…☆69Sep 11, 2025Updated 8 months ago
- VIKUS IIIF Generator☆17Oct 28, 2025Updated 6 months ago
- A Django-based web application that simplifies exam lifecycle management from creation to grading, integrating OCR and AI for an automate…☆11Jul 7, 2024Updated last year
- Code repository for the course "Forecasting with Machine Learning Models"☆29Mar 12, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fuzzy search modules for searching lists of words in low quality OCR and HTR text.☆23Mar 30, 2026Updated last month
- Erlangen CRM - An OWL implementation of the CIDOC Conceptual Reference Model☆45Sep 20, 2024Updated last year
- Offline Handwritten Text Recognition (HTR) system☆19Aug 8, 2019Updated 6 years ago
- Pytorch implementation of HTR on IAM dataset (word or line level + CTC loss)☆21Jul 28, 2022Updated 3 years ago
- Turn a folder of images into a working IIIF setup – in a minute or less!☆62Mar 24, 2026Updated last month
- Taller de pgRouting para la Reunión de Usuarios QGIS México 2019☆23Aug 16, 2022Updated 3 years ago
- Supercharge 2DGS with SIBR monitor☆23Oct 1, 2025Updated 7 months ago