Bhashini-IITJ / IndicPhotoOCR
Comprehensive Scene Text Recognition Toolkit across 13 Indian Languages
☆15Updated last week
Alternatives and similar repositories for IndicPhotoOCR:
Users that are interested in IndicPhotoOCR are comparing it to the libraries listed below
- Graph-based Layout Analysis Model☆16Updated 4 months ago
- Official implementation for ICDAR 2021 best poster paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Tr…☆124Updated last year
- This repository is created to share current progress of transformer based optical character recognition(OCR). Welcome to share~☆48Updated last year
- OCR Annotations from Amazon Textract for Industry Documents Library☆101Updated 2 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆117Updated last year
- OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes☆56Updated 3 weeks ago
- IndicGenBench is a high-quality, multilingual, multi-way parallel benchmark for evaluating Large Language Models (LLMs) on 4 user-facing …☆43Updated 5 months ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆267Updated 2 years ago
- A tiny nearest-neighbor embedding database written in C☆19Updated last year
- A collection of OCR-related datasets☆150Updated 2 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆345Updated 2 years ago
- YOLOv10 trained on DocLayNet dataset.☆71Updated 3 months ago
- A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition☆129Updated last week
- Unofficial implementation of the paper "Full Page Handwriting Recognition via Image to Sequence Extraction" by Singh et al. (2021).☆52Updated 2 years ago
- Line Chart Data Extraction: Official code for LineFormer - ICDAR23 Paper☆27Updated 8 months ago
- Object Detection Model for Scanned Documents☆88Updated last year
- Handwritten text recognition using transformers.☆156Updated 6 months ago
- An open-source tool for visualisation of outputs of deep-learning models for document analysis tasks such as fully automatic, bounding bo…☆21Updated 2 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆122Updated 9 months ago
- Soverign rollup based on Rollkit, Cairo VM for the application layer and Bitcoin as a DA layer☆13Updated last year
- Translate Python code to Coq code for formal verification. Applied to the reference implementation of Ethereum in Python.☆32Updated 5 months ago
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆102Updated 10 months ago
- Repo to host the forms dataset☆15Updated 4 years ago
- A star path planning algorithm based line segmentation of handwritten document☆19Updated last year
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆177Updated 2 months ago
- Use Convolutional Recurrent Neural Network to recognize the Handwritten line text image without pre segmentation into words or characters…☆299Updated 2 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆133Updated last month
- PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)☆301Updated 10 months ago
- The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…☆277Updated 2 months ago