mateuszwosinski / ocr-with-bert
Improving quality of OCR with typo recognition and correction using pretrained BERT model.
☆10Updated 3 years ago
Alternatives and similar repositories for ocr-with-bert
Users that are interested in ocr-with-bert are comparing it to the libraries listed below
Sorting:
- ☆17Updated 4 years ago
- Handwritten Number Recognition using CNN and Character Segmentation☆18Updated 7 years ago
- DFKI Layout Detection for OCR-D☆47Updated 2 weeks ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22☆13Updated last year
- handwritten text recognition on IAM handwriting dataset☆15Updated 5 years ago
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 4 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆36Updated last year
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 6 years ago
- ☆25Updated 7 years ago
- A dataset of region-annotated scientific articles.☆21Updated 5 years ago
- Implementation of BertGrid : https://arxiv.org/abs/1909.04948☆30Updated last year
- ☆23Updated 5 years ago
- A Unet based deeplearning model to line/box/spurious artifacts from text images. Unsupervised training.☆59Updated 5 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆21Updated 3 years ago
- Image to Latex using Encoder-Decoder architecture☆13Updated last year
- ☆13Updated last year
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared wi…☆45Updated 10 months ago
- English Handwriting Recognition with CRNN and CTC Loss☆22Updated 6 years ago
- Implementation of the DocLLM paper for Llama models.☆13Updated last month
- This repository contains an implementation of the "Representation Learning for Information Extraction from Form-like Documents" paper.☆25Updated 4 years ago
- Detect textlines in document images☆93Updated 11 months ago
- ☆22Updated 4 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆68Updated last year
- ☆38Updated 4 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated last month
- Google Colab Demo of CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents☆46Updated 3 years ago