Mushroomcat9998 / PaddleOCR
Custom repo for training Japanese OCR
☆26Updated 3 years ago
Alternatives and similar repositories for PaddleOCR:
Users that are interested in PaddleOCR are comparing it to the libraries listed below
- Key information extraction from invoice document with Graph Convolution Network☆55Updated last year
- A collection for AI Engineer☆39Updated 8 months ago
- ☆34Updated 2 years ago
- The task aims at extracting required fields in receipts captured by mobile devices☆32Updated 2 years ago
- ☆54Updated last year
- Scene text vietnamese☆14Updated 2 years ago
- Object Detection Model for Scanned Documents☆90Updated 3 weeks ago
- This Repository consists of all my experiments performed on LayoutLMv3 model.☆29Updated 2 years ago
- ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for us…☆57Updated 3 weeks ago
- ☆27Updated 3 years ago
- An NVIDIA Triton Server workflow for OCR and the LayoutLMv3 Transformer Model☆30Updated 2 years ago
- ☆20Updated 3 years ago
- Create TensorRT-runtime for vietocr☆12Updated 3 years ago
- EraX-VL-7B-V1 is the multimodal large language model developed by EraX team, base on Qwen2-VL.☆10Updated 3 months ago
- A curated list of papers about key information extraction.☆91Updated 3 months ago
- Face Anti-Spoofing project. Lanit-Tercom summer school 2022☆46Updated 2 years ago
- ☆42Updated 2 years ago
- This is our solution dealing with BKAI challenge☆62Updated 2 years ago
- Vietnamese celebrities facial recognition - Competition of AIvivn.com☆21Updated 2 years ago
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆144Updated 8 months ago
- A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition☆149Updated last week
- This repository serves as an example of deploying the YOLO models on Triton Server for performance and testing purposes☆59Updated 10 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 10 months ago
- Use LoRA technique to improve training Large Language Model☆12Updated last year
- This repository utilizes the Triton Inference Server Client, which streamlines the complexity of model deployment.☆17Updated 7 months ago
- Machine Learning Project to identify an ID Card on an image☆58Updated 8 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆133Updated 2 months ago
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆23Updated 4 months ago
- ☆59Updated 7 months ago
- ☆35Updated 3 years ago