mateuszwosinski / ocr-with-bertLinks

Improving quality of OCR with typo recognition and correction using pretrained BERT model.

☆10

Alternatives and similar repositories for ocr-with-bert

Users that are interested in ocr-with-bert are comparing it to the libraries listed below

Sorting:

DS3Lab / TableParser
Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22
☆14Updated last year
omarsou / layoutlm_CORD
Evaluation of the Layoutlm model on the CORD dataset
☆32Updated 3 years ago
Noba1anc3 / Document-Analysis-Recognition
☆17Updated 4 years ago
OCR-D / ocrd_anybaseocr
DFKI Layout Detection for OCR-D
☆47Updated 2 months ago
yikeqicn / DeepErase
A Unet based deeplearning model to line/box/spurious artifacts from text images. Unsupervised training.
☆59Updated 5 years ago
NVlabs / ocropus3-ocrobin
☆25Updated 7 years ago
octanove / grammartagger
GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning
☆28Updated 4 years ago
kavishgambhir / xy-cut-tree
Segmenting a given document using recursive xy-cut algorithm.
☆12Updated 6 years ago
tmbdev-tutorials / icdar2019-worksheets
☆25Updated 5 years ago
whq-hqw / sroie2019
This is an OCR solution for receipts, invoices, etc.
☆20Updated 5 years ago
tulasiram58827 / Information-Extraction-From-Documents
This repository contains an implementation of the "Representation Learning for Information Extraction from Form-like Documents" paper.
☆25Updated 4 years ago
ternaus / base64ToImageConverters
Library for converting from RGB / GrayScale image to base64 and back.
☆19Updated 2 years ago
applicaai / kleister-nda
☆58Updated 3 years ago
swapnil-ahlawat / Document_Layout_Analysis-MonkAI
DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…
☆26Updated 4 years ago
huridocs / pdf-reading-order
☆13Updated last year
qurator-spk / sbb_textline_detection
Detect textlines in document images
☆93Updated last year
sam-ai / BertGrid
Implementation of BertGrid : https://arxiv.org/abs/1909.04948
☆30Updated last year
kayoyin / DirtyDocuments
☆22Updated 5 years ago
jarobyte91 / post_ocr_correction
Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"
☆37Updated last year
SparshaSaha / Handwritten-Number-Recognition-With-Image-Segmentation
Handwritten Number Recognition using CNN and Character Segmentation
☆18Updated 7 years ago
gmarus777 / Printed-Latex-Data-Generation
Python and JS tools to generate Printed LaTex formulas and images
☆16Updated last year
ocr-d-modul-2-segmentierung / ocrd-pixelclassifier-segmentation
Wrapper around pixel classifier
☆9Updated 3 years ago
nikolamilosevic86 / TabInOut
Framework for information extraction from tables
☆41Updated 6 years ago
AlibabaPAI / one_shot_text_labeling
code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"
☆61Updated 4 years ago
cneud / ocr-gt
OCR & Ground Truth Resources
☆76Updated 3 years ago
PrithivirajDamodaran / vision-language-modelling-series
Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations
☆14Updated 2 years ago
KMKnation / Four-Point-Invoice-Transform-with-OpenCV
I have customized the code of Adrian to find 4 points of document or rectangle dynamically. Here i have added findLargestCountours and co…
☆38Updated 7 years ago
AmanSavaria1402 / TableNet
TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…
☆59Updated 3 years ago
karndeb / Arxiv-Neural-Search
Neural Search System on Arxiv AI/ML Papers
☆54Updated 3 years ago
applicaai / kleister-charity
☆40Updated 3 years ago