ja-mcm/OCRfixr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ja-mcm/OCRfixr)

ja-mcm / OCRfixr

A context-based spellchecker for correcting OCR output.

☆21

Alternatives and similar repositories for OCRfixr

Users that are interested in OCRfixr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hirmeos / entity-fishing-client-python
View on GitHub
Repository hosting the common code for the entity-fishing clients
☆10May 18, 2026Updated 2 months ago
quadrismegistus / lltk
View on GitHub
Literary Language Toolkit: code, models, corpora, and web tools
☆11Jul 5, 2026Updated 3 weeks ago
pjox / gutf
View on GitHub
Terminal tool that converts files encoding to UTF-8
☆10Oct 5, 2019Updated 6 years ago
tedunderwood / DataMunging
View on GitHub
Scripts that clean up OCR and munge Hathi metadata.
☆78Nov 4, 2017Updated 8 years ago
nealhaddaway / SRflowdiagram
View on GitHub
☆14Nov 25, 2020Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
DEFI-COLaF / LADaS
View on GitHub
Layout Analysis Dataset with Segmonto (LADaS)
☆25May 29, 2026Updated last month
PonteIneptique / choco-mufin
View on GitHub
Tools for normalizing the use of some characters and checking file consistencies
☆12May 30, 2026Updated last month
synopsx / synopsx
View on GitHub
SynopsX is a lightweight XML publishing framework
☆13Jul 10, 2026Updated 2 weeks ago
kermitt2 / biblio-glutton-extension
View on GitHub
A browser extension providing Open Access bibliographical services
☆18Dec 9, 2022Updated 3 years ago
cisocrgroup / ocrd_cis
View on GitHub
OCR-D python tools
☆33Aug 16, 2024Updated last year
jakelever / knowledgediscovery
View on GitHub
Analysis code for knowledge discovery project
☆12Sep 25, 2018Updated 7 years ago
gregdeon / spotlight
View on GitHub
Implementation of the spotlight: a method for discovering systematic errors in deep learning models
☆11Oct 5, 2021Updated 4 years ago
cambridgeltl / SIPHS
View on GitHub
☆15Sep 20, 2018Updated 7 years ago
texttechnologylab / GerParCor
View on GitHub
German Parliamentary Corpus (GerParCor)
☆32Mar 29, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
greenelab / knowledge-graph-review
View on GitHub
A literature review for constructing and using knowledge graphs in a biomedical setting.
☆11May 22, 2020Updated 6 years ago
HazyResearch / ddbiolib
View on GitHub
DeepDive Biomedical Tools
☆15Apr 3, 2017Updated 9 years ago
wabyking / word2fun
View on GitHub
☆11May 9, 2022Updated 4 years ago
Ejhfast / empath-outofdate
View on GitHub
☆10Jul 17, 2015Updated 11 years ago
thunlp / CSS-LM
View on GitHub
CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models
☆11Jul 1, 2023Updated 3 years ago
OpenNMT / nmt-wizard-docker
View on GitHub
Dockerized NMT frameworks for nmt-wizard
☆39Apr 18, 2023Updated 3 years ago
ZhaofengWu / SIFT
View on GitHub
☆14Aug 3, 2022Updated 3 years ago
arwhirang / DDI-recursive-NN
View on GitHub
☆11Feb 2, 2018Updated 8 years ago
natliblux / nautilusocr
View on GitHub
METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)
☆56May 30, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
shauli-ravfogel / descriptions
View on GitHub
☆10May 11, 2024Updated 2 years ago
dadelani / menyo-20k_MT
View on GitHub
☆11Jul 12, 2021Updated 5 years ago
lezhang7 / TreeMix
View on GitHub
[NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
☆10Jul 15, 2023Updated 3 years ago
OrKatz7 / parler-hate-speech
View on GitHub
☆10Jun 18, 2023Updated 3 years ago
basaldella / bioreddit
View on GitHub
Word embeddings trained on medical subreddits.
☆10Jan 4, 2021Updated 5 years ago
PedroBarcha / context-spelling-correction
View on GitHub
Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…
☆11Dec 13, 2018Updated 7 years ago
mohit3011 / AbuseAnalyzer
View on GitHub
Repository for our paper "AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts"
☆11Jul 18, 2021Updated 5 years ago
Tian312 / PICO_Parser
View on GitHub
A clinical BERT-based NLP tool for parsing clinical trial abstracts following the PICO framework
☆47Oct 15, 2020Updated 5 years ago
WHaverals / CERberus
View on GitHub
CERberus -- guardian against character errors
☆30Jul 3, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nlpcl-lab / CADD_dataset
View on GitHub
CADD: A Large-scale Comprehensive Abusiveness Detection Dataset with Multifaceted Labels from Reddit
☆12Sep 28, 2022Updated 3 years ago
repozhang / malevolent_dialogue
View on GitHub
MDRDC dataset and used baselines
☆11Feb 20, 2023Updated 3 years ago
JunjieHu / amber
View on GitHub
Explicit Alignment Objectives for Multilingual Bidirectional Encoders
☆14Apr 14, 2021Updated 5 years ago
StonyBrookNLP / PerSenT
View on GitHub
[COLING2020] A challenge dataset for Person SenTiment analysis in news domain.
☆11May 2, 2022Updated 4 years ago
applicaai / pyramidions
View on GitHub
This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…
☆14May 15, 2022Updated 4 years ago
NastaranBa / ACE-for-Sarcasm-Detection
View on GitHub
☆11Dec 1, 2020Updated 5 years ago
stefan-it / ukrainian-electra
View on GitHub
Ukrainian ELECTRA model
☆12Mar 11, 2023Updated 3 years ago