OCR post correction for old German corpus
☆20Aug 29, 2022Updated 3 years ago
Alternatives and similar repositories for OCR_POST_DE
Users that are interested in OCR_POST_DE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 6 months ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆23Apr 23, 2024Updated 2 years ago
- Neo4j powered web application for multimedia collections: bring graph-based exploration and crowd-based indexation.☆24May 1, 2020Updated 6 years ago
- ☆15Jul 11, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- ☆15Jan 9, 2019Updated 7 years ago
- ☆11Nov 14, 2021Updated 4 years ago
- Named Entity Recognition☆19Feb 13, 2026Updated 4 months ago
- ☆12Jun 13, 2025Updated last year
- OCR-D wrapper for detectron2 based segmentation models☆16May 1, 2025Updated last year
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- ☆24Dec 8, 2022Updated 3 years ago
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.☆12Sep 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fork of dhSegment for experiments on visual and textual feature combination.☆15Jan 30, 2021Updated 5 years ago
- ☆10Aug 5, 2019Updated 6 years ago
- Double-checked Gold Standard Data for Training and Testing OCR Engines☆21Dec 31, 2022Updated 3 years ago
- Data for the HIPE 2022 shared task.☆23May 15, 2026Updated last month
- React component for rendering RDF graphs and datasets using n3.js and cytoscape.js☆10Nov 8, 2021Updated 4 years ago
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- Library in C++ and a python wrapper for dealing with Page XML files☆13Apr 25, 2025Updated last year
- (ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper☆89May 25, 2023Updated 3 years ago
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- GloSAT Historical Measurement Table Dataset☆11Dec 3, 2025Updated 6 months ago
- A generic webservice to extract RDF statements from XML resources☆19May 21, 2024Updated 2 years ago
- Wrapper for the kraken OCR engine☆12Jul 12, 2025Updated 11 months ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26May 10, 2021Updated 5 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆56May 30, 2023Updated 3 years ago
- Neural graph-based dependency parser☆13Dec 20, 2017Updated 8 years ago
- pyndl implements a Naive discriminative learning which is a learning and classification models based on the Rescorla-Wagner equations in …☆13Dec 8, 2025Updated 6 months ago
- ☆141Mar 5, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆25Oct 27, 2023Updated 2 years ago
- An integration of Sigma.js with Neo4j and some custom render☆19May 31, 2022Updated 4 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆39Dec 14, 2021Updated 4 years ago
- PyTorch implementation of L2R2 in SIGIR 2020☆17Jun 12, 2023Updated 3 years ago
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 5 years ago
- pydistinto - a Python implementation of different measures of distinctiveness for contrastive text analysis☆11May 15, 2025Updated last year
- This is the official repository of the EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Tok…☆18Mar 15, 2024Updated 2 years ago