ruathudo/post-ocr-correction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ruathudo/post-ocr-correction)

ruathudo / post-ocr-correction

☆11

Alternatives and similar repositories for post-ocr-correction

Users that are interested in post-ocr-correction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mikahama / natas
View on GitHub
Python 3 library for processing historical English
☆68Aug 10, 2024Updated last year
ltgoslo / simple_elmo_training
View on GitHub
Minimal code to train ELMo models in recent versions of TensorFlow
☆14Jun 16, 2026Updated last month
TurkuNLP / ocr-correction
View on GitHub
Post-processing OCR errors with seq2seq models
☆28Jul 30, 2020Updated 5 years ago
mikahama / murre
View on GitHub
The amazing 🐕will normalize non-standard Finnish/Swedish and dialectalize standard Finnish!
☆31Aug 10, 2024Updated last year
antonisa / embeddings
View on GitHub
Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages
☆15Apr 11, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shrutirij / ocr-post-correction
View on GitHub
☆141Mar 5, 2024Updated 2 years ago
mjpost / bin
View on GitHub
bin files
☆13Jan 30, 2025Updated last year
BlackKakapo / Romanian-Word-Embeddings
View on GitHub
Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gen…
☆13Oct 6, 2025Updated 9 months ago
tsproisl / SoMeWeTa
View on GitHub
A part-of-speech tagger with support for domain adaptation and external resources.
☆24Oct 26, 2022Updated 3 years ago
Helsinki-NLP / subalign
View on GitHub
☆16Sep 28, 2023Updated 2 years ago
GarfieldLyu / OCR_POST_DE
View on GitHub
OCR post correction for old German corpus
☆20Aug 29, 2022Updated 3 years ago
sushant1827 / Trigger-Word-Detection
View on GitHub
Coursera - RNN Programming Assignment: In this project, we will construct a speech dataset and implement an algorithm for trigger word de…
☆10Aug 29, 2021Updated 4 years ago
shreyshah97 / Newspaper-Segmentation
View on GitHub
Newspaper Segmentation into images and text
☆12Jan 11, 2019Updated 7 years ago
mikahama / FinMeter
View on GitHub
Tools for assessing Finnish poetry: rhymes, meter, hyphenation of Finnish and so on.
☆13Dec 13, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
kmike / dialog2017
View on GitHub
☆10Jul 21, 2017Updated 9 years ago
lluz / jquery-conveyor-ticker
View on GitHub
Simple horizontal conveyor belt animated ticker.
☆12Nov 23, 2022Updated 3 years ago
RealKinetic / aws-glue-pipeline-example
View on GitHub
An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.
☆13Oct 15, 2020Updated 5 years ago
hsci-r / las
View on GitHub
Linguistic Analysis Command-Line Tool
☆14Sep 23, 2019Updated 6 years ago
mikahama / pdfy
View on GitHub
A Python library for converting HTML files into PDF with Chrome's engine.
☆21Aug 10, 2024Updated last year
alan-turing-institute / room2glo
View on GitHub
☆11Jan 20, 2020Updated 6 years ago
cisnlp / parcoure
View on GitHub
ParCourE - Parallel Corpus Explorer
☆12Dec 27, 2021Updated 4 years ago
jordij / jordijoan.me
View on GitHub
Personal blog site using Wagtail CMS
☆19Dec 27, 2022Updated 3 years ago
SapienzaNLP / clubert
View on GitHub
Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.
☆10Jan 4, 2021Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
timarkh / uniparser-grammar-udm
View on GitHub
Morphological analysis for Udmurt.
☆12May 23, 2026Updated last month
akcarsten / cook_book
View on GitHub
Jupyter notebooks for the articles on Medium about translating a cook book
☆13Nov 18, 2019Updated 6 years ago
UBC-NLP / aoc_id
View on GitHub
Arabic Dialect Identification on AOC data.
☆24Mar 2, 2019Updated 7 years ago
anklowait / python_for_CL
View on GitHub
материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)
☆12Feb 21, 2022Updated 4 years ago
spyysalo / bert-pos
View on GitHub
Part-of-speech tagging using BERT
☆10Nov 14, 2019Updated 6 years ago
maria-antoniak / fight-harassment-in-research
View on GitHub
☆17Aug 19, 2024Updated last year
yeazin / Stackoverflow-Clone
View on GitHub
This project is a clone version of a Famous Developers community website Stackoverflow.
☆10Jul 4, 2021Updated 5 years ago
COST-ELTeC / ELTeC
View on GitHub
Umbrella repository that describes the collections contained in any given release of ELTeC
☆13Jan 26, 2022Updated 4 years ago
hslh / pie-detection
View on GitHub
Automatic Detection of Potentially Idiomatic Expressions
☆12Feb 19, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mikahama / syntaxmaker
View on GitHub
The NLG tool for Finnish
☆24Dec 13, 2023Updated 2 years ago
munnellg / OfMagesAndMagic
View on GitHub
A simple fighting game for teaching Python
☆14Dec 12, 2016Updated 9 years ago
vmkhlv / hse_compling_and_it
View on GitHub
Материалы курса "Компьютерная лингвистика и информационные технологии" для 4-го курса бакалавриата направления "Фундаментальная и приклад…
☆10Mar 25, 2021Updated 5 years ago
amunategui / Read-and-Process-Files-Larger-Than-RAM
View on GitHub
Using the function read.table() to break file into chunks to loop and process them. This allows processing files of any size beyond what …
☆10Aug 19, 2014Updated 11 years ago
clab / cnn-v1
View on GitHub
Legacy version of CNN neural net toolkit (now called dynet)
☆19Oct 8, 2016Updated 9 years ago
jarobyte91 / post_ocr_correction
View on GitHub
Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"
☆39Dec 2, 2023Updated 2 years ago
supasorn / GoogleScholarCopyBibTeX
View on GitHub
Copy BibTeX on Google Scholar Search page with a single click
☆20Nov 5, 2023Updated 2 years ago