PedroBarcha / old-books-datasetLinks
Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are eventually going to be uploaded.
☆15Updated 8 years ago
Alternatives and similar repositories for old-books-dataset
Users that are interested in old-books-dataset are comparing it to the libraries listed below
Sorting:
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆27Updated 6 years ago
- ☆18Updated last year
- CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images☆133Updated 3 months ago
- Close-Domain fine-tuning for table detection☆72Updated 3 years ago
- Handwritten text recognition using transformers.☆158Updated last year
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆33Updated 3 years ago
- TableNet Implementation on Pytorch☆150Updated 3 years ago
- Pytorch implementation of our paper: Adapting OCR with Limited Labels☆62Updated last year
- Repo to host the forms dataset☆17Updated 4 years ago
- Pytorch Implementation of TableNet☆67Updated 4 years ago
- CVPR 2022: Table Structure Recognition☆40Updated 3 years ago
- https://betterprogramming.pub/table-detection-and-extraction-tablenet-deep-learning-model-with-pytorch-from-images-64489e92b641☆15Updated 2 years ago
- ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...☆182Updated 4 years ago
- ☆15Updated 5 years ago
- Document Visual Question Answering☆128Updated 5 years ago
- Extraction of meaningful instances from document images with a Chargrid model☆34Updated 4 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.☆38Updated 3 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆70Updated last year
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated 2 years ago
- Detectron2 for Document Layout Analysis☆187Updated last year
- Lightweight CRNN for OCR (including handwritten text) with depthwise separable convolutions and spatial transformer module [keras+tf]☆149Updated 6 years ago
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆129Updated 2 years ago
- A Dense Text Detection model using Receptive Field Blocks☆31Updated 3 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆139Updated last year
- Research papers and code on information extraction from image/pdf☆97Updated 3 years ago
- ☆87Updated 5 years ago
- list all open dataset about ocr.☆100Updated 7 years ago
- ☆126Updated 5 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆62Updated last year