QuickSign/ocrized-text-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QuickSign/ocrized-text-dataset)

QuickSign / ocrized-text-dataset

Quicksign OCRized Text Dataset (QS-OCR)

☆45

Alternatives and similar repositories for ocrized-text-dataset

Users that are interested in ocrized-text-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bharathrajcl / Multimodal-deep-networks-for-text-and-image-based-document-classification
View on GitHub
It is an implementation of research paper with title 'Multimodal deep networks for text and image-based document classification'
☆13Jul 31, 2021Updated 4 years ago
bertsky / ocrd_publaynet
View on GitHub
convert PubLayNet data into METS/PAGE-XML
☆10Mar 17, 2020Updated 6 years ago
dhlab-epfl / dhSegment-torch
View on GitHub
dhSegment on pytorch
☆35Jun 12, 2023Updated 3 years ago
Shreeshrii / ocr-evaluation-tools
View on GitHub
☆16Mar 24, 2021Updated 5 years ago
javiferran / document-classification
View on GitHub
☆15Jun 22, 2020Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
elias-ramzi / HAPPIER
View on GitHub
This repo contains the official implementation of HAPPIER: Hierarchical Average Precision Training for Pertinent Image Retrieval (ECCV'22…
☆24Apr 6, 2023Updated 3 years ago
tmbdev-talks / icdar2019-worksheets
View on GitHub
☆25Apr 18, 2020Updated 6 years ago
herobd / FUDGE
View on GitHub
Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"
☆33Mar 4, 2022Updated 4 years ago
sam-ai / BertGrid
View on GitHub
Implementation of BertGrid : https://arxiv.org/abs/1909.04948
☆30Apr 10, 2024Updated 2 years ago
ronghanghu / vqa-maskrcnn-benchmark-m4c
View on GitHub
Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…
☆13Jan 30, 2020Updated 6 years ago
altoxml / schema
View on GitHub
ALTO XML schema - latest and all former versions
☆55Jul 8, 2026Updated 2 weeks ago
AILab-UniFI / cte-dataset
View on GitHub
CTE: Contextualized Table Extraction Dataset
☆17Feb 23, 2023Updated 3 years ago
omni-us / research-seq2seq-HTR
View on GitHub
☆21Jul 24, 2019Updated 7 years ago
mrzjy / sunburst
View on GitHub
A simple Python implementation of ngram sunburst (nested pie chart) visualization showed in CoQA paper
☆14Mar 12, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
outcomesinsights / conceptql
View on GitHub
A high-level language that allows researchers to unambiguously define their research algorithms.
☆18Jul 17, 2026Updated last week
sciencefictionlab / chargrid-pytorch
View on GitHub
Pytorch Implementation of Chargrid Paper (https://arxiv.org/abs/1809.08799)
☆27Mar 11, 2022Updated 4 years ago
nttmdlab-nlp / VisualMRC
View on GitHub
VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)
☆57Mar 31, 2025Updated last year
samyakbhuta / chhapkaam
View on GitHub
ગુજરાતી ફોન્ટ અવલોકન
☆19Jan 11, 2014Updated 12 years ago
mittagessen / kraken-models
View on GitHub
Recognition Models for Kraken and CLSTM
☆17Aug 21, 2019Updated 6 years ago
biswassanket / synth_doc_generation
View on GitHub
Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021
☆93Jul 16, 2021Updated 5 years ago
herobd / NAF_dataset
View on GitHub
Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.
☆38May 12, 2022Updated 4 years ago
seanbenhur / hindi_image_captioning
View on GitHub
A Hindi Image Captioning system made completely with Transformers🤗
☆10Apr 16, 2024Updated 2 years ago
antoinedelplace / Chargrid
View on GitHub
Extraction of meaningful instances from document images with a Chargrid model
☆34Aug 9, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FactoDeepLearning / LinePytorchOCR
View on GitHub
☆17Feb 16, 2023Updated 3 years ago
Shreeshrii / tess5train-fonts
View on GitHub
Files and Scripts to run Tesseract 5 LSTM Training using fonts
☆78Feb 6, 2022Updated 4 years ago
benedikt-budig / glyph-miner
View on GitHub
Glyph Miner, a system for extracting glyphs from early typeset prints
☆34Sep 29, 2016Updated 9 years ago
Michael-Xiu / ICDAR-SROIE
View on GitHub
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
☆29Apr 25, 2019Updated 7 years ago
zlwang-cs / LASER-release
View on GitHub
Repo for the paper: Towards Few-shot Entity Recognition in Document Images:A Label-aware Sequence-to-Sequence Framework
☆14May 31, 2023Updated 3 years ago
athento / hocr-parser
View on GitHub
HOCR Specification Python Parser
☆12Sep 23, 2015Updated 10 years ago
DCGM / pero-ocr
View on GitHub
☆72Jul 16, 2026Updated last week
ljos / navnkjenner
View on GitHub
Named-Entity Recognition for Norwegian Bokmål and Nynorsk
☆12Aug 5, 2019Updated 6 years ago
ejmichaud / precision-ml
View on GitHub
☆13Feb 12, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
phamquiluan / table-transformer
View on GitHub
CVPR 2022: Table Structure Recognition
☆40Apr 19, 2022Updated 4 years ago
Early-Modern-OCR / hOCR-De-Noising
View on GitHub
code to remove "noise" from hOCR output of Tesseract OCR.
☆14Oct 24, 2016Updated 9 years ago
tukeyclothespin / scimitar
View on GitHub
Arabic Text Detection in Images
☆15Apr 5, 2018Updated 8 years ago
outerbounds / nbdoc
View on GitHub
Generate beautiful, testable documentation with Jupyter Notebooks
☆21Jul 25, 2022Updated 3 years ago
navdeep-G / interpretable-ml
View on GitHub
Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.
☆21Feb 2, 2026Updated 5 months ago
melvinwevers / CV_tutorial
View on GitHub
Computer Vision tutorial for DH Summer School Antwerp
☆11Jul 10, 2026Updated 2 weeks ago
ITUnlp / UniParse
View on GitHub
UniParse: A universal graph-based parsing toolkit
☆11Oct 2, 2019Updated 6 years ago