IlyasMoutawwakil / Dive-into-OCR-by-Paddle
“Dive Into OCR” is a textbook developed by the PaddleOCR community that integrates OCR theory and practice.
☆9Updated 2 years ago
Alternatives and similar repositories for Dive-into-OCR-by-Paddle:
Users that are interested in Dive-into-OCR-by-Paddle are comparing it to the libraries listed below
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆318Updated last year
- Pytorch implementation of Paper by Google Research - Representation Learning for Information Extraction from Form-like Documents.☆101Updated 2 years ago
- Pytorch Implementation of TableNet☆63Updated 3 years ago
- An end to end Deep Learning Solution for table detection and structure recognition☆11Updated 3 years ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆24Updated last year
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆117Updated last year
- This Repository consists of all my experiments performed on LayoutLMv3 model.☆29Updated 2 years ago
- Research papers and code on information extraction from image/pdf☆96Updated 2 years ago
- ☆74Updated 2 years ago
- ☆10Updated 3 years ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆267Updated 2 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆122Updated 9 months ago
- Streamlit Named Entity Recognition (NER) annotation custom component☆39Updated 2 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆175Updated 2 months ago
- Simplifying cheque processing for banks using Transformers☆17Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- ☆20Updated 3 years ago
- This repo contains code to convert Structured Documents to Graphs and implement a Graph Convolution Neural Network for node classificatio…☆145Updated 2 years ago
- ☆13Updated 4 years ago
- TableNet Implementation on Pytorch☆147Updated 2 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆345Updated 2 years ago
- CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)☆157Updated 2 years ago
- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:☆270Updated 2 years ago
- Financial Domain Question Answering with pre-trained BERT Language Model☆123Updated last year
- Document processing using transformers☆20Updated last year
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆54Updated 2 years ago
- This repository contains an implementation of the "Representation Learning for Information Extraction from Form-like Documents" paper.☆25Updated 4 years ago
- A curated list of papers about key information extraction.☆90Updated 2 months ago
- Infographic about the inner computations of a transformer model, training and inference☆82Updated 10 months ago
- ☆349Updated last year