thinh-re / s-multimae
☆13Updated last month
Alternatives and similar repositories for s-multimae:
Users that are interested in s-multimae are comparing it to the libraries listed below
- Vietnamese handwritten text recognition system☆17Updated 3 years ago
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆143Updated 6 months ago
- a dataset for camera-based table detection☆16Updated 3 years ago
- Scene text recognition☆106Updated 2 years ago
- ☆73Updated last year
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆81Updated last year
- Create TensorRT-runtime for vietocr☆11Updated 3 years ago
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆69Updated last year
- ☆77Updated last week
- CVPR 2022: Table Structure Recognition☆39Updated 2 years ago
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Updated 7 months ago
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆58Updated 5 months ago
- A curated list of papers about key information extraction.☆89Updated last month
- Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labelin…☆254Updated 7 months ago
- CRAFT(Baek et al., 2019) model training code☆45Updated 5 months ago
- The task aims at extracting required fields in receipts captured by mobile devices☆32Updated 2 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆66Updated 11 months ago
- ☆15Updated last year
- ☆27Updated 3 years ago
- ☆41Updated 6 months ago
- The official implementation of SPTS v2: Single-Point Text Spotting☆130Updated last year
- A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)☆104Updated 3 years ago
- Official implementation for Dessurt☆57Updated 2 years ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆48Updated 7 months ago
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Updated 2 years ago
- swin-transformer custom for OCR☆114Updated 11 months ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆132Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 8 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆131Updated 2 weeks ago
- ☆29Updated 2 years ago