sambalshikhar / Document-Image-Classification-with-Intra-Domain-Transfer-Learning-and-Stacked-Generalization-of-Deep
RVL-CDIP could be looked at as the equivalent of ImageNet for the document image community. It’s certainly the largest we’ve seen in the literature. There are 400,000 total document images in the dataset. The dataset contains much noise and variance in composition of each document class. Uncompressed, the dataset size is ~100GB, and comprises 16…
☆18Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Document-Image-Classification-with-Intra-Domain-Transfer-Learning-and-Stacked-Generalization-of-Deep
- ☆60Updated 3 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 3 years ago
- This repository shows how to train a custom detection model with the TFOD API, optimize it with TFLite, and perform inference with the op…☆31Updated 3 years ago
- ☆28Updated 2 years ago
- code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"☆61Updated 4 years ago
- ☆12Updated 4 years ago
- BFSI sectors deal with lots of unstructured scanned documents which are archived in document management systems for further use.For examp…☆40Updated 3 years ago
- ☆15Updated 3 years ago
- A TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem☆33Updated 5 years ago
- ☆15Updated 4 years ago
- This repository contains an implementation of the "Representation Learning for Information Extraction from Form-like Documents" paper.☆25Updated 3 years ago
- Document processing using transformers☆20Updated last year
- Detect handwritten words (neural network based).☆66Updated 2 years ago
- To try CTC in Keras☆17Updated 5 years ago
- Working codes for project☆23Updated last year
- Optical character recognition Using Deep Learning☆29Updated 6 years ago
- Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks☆42Updated 5 years ago
- A simple tf.keras 2.1 implementation of CRNN OCR with data generator☆36Updated 4 years ago
- Notes and code on computer vision course ,PyImageSearch Gurus.☆29Updated 4 years ago
- NLP | NER | SpaCy☆26Updated 3 years ago
- Research papers and code on information extraction from image/pdf☆96Updated last year
- This PyTorch implementation of LayoutLM paper by Microsoft demonstrate the SequenceClassfication task using HuggingFaceTransformers to cl…☆31Updated 2 years ago
- Code and procdures for handwriting object detection and recognition☆79Updated 3 years ago
- End to end tutorial on using Detectron2 for object detection☆12Updated last year
- Machine Learning Project to identify an ID Card on an image☆49Updated 2 years ago
- ☆9Updated 2 years ago
- ☆22Updated 3 years ago
- A Unet based deeplearning model to line/box/spurious artifacts from text images. Unsupervised training.☆57Updated 5 years ago
- Template based form extractor OCR. Train your own character and alphabet OCR.☆16Updated 6 years ago