thinh-re / s-multimaeLinks
[ICPR-2024] S-MultiMAE - A Multi-Ground Truth approach for RGB-D Saliency Detection
☆12Updated 11 months ago
Alternatives and similar repositories for s-multimae
Users that are interested in s-multimae are comparing it to the libraries listed below
Sorting:
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆152Updated last year
- Vietnamese handwritten text recognition system☆17Updated 4 years ago
- PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)☆311Updated last year
- DocILE: Document Information Localization and Extraction Benchmark☆138Updated last year
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated 2 years ago
- Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation☆203Updated 5 months ago
- CRAFT(Baek et al., 2019) model training code☆50Updated last year
- A collection of OCR-related datasets☆197Updated 3 years ago
- Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labelin…☆265Updated last year
- Scene text recognition☆108Updated 3 years ago
- Hadwritten Text Recognition in Few-shot Scenario☆22Updated 2 years ago
- The task aims at extracting required fields in receipts captured by mobile devices☆33Updated 3 years ago
- swin-transformer custom for OCR☆116Updated last year
- a dataset for camera-based table detection☆16Updated 4 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Updated 3 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆70Updated last year
- Code for BMVC2020 paper "Text and Style Conditioned GAN for Generation of Offline Handwriting Lines"☆73Updated 2 years ago
- ☆88Updated 9 months ago
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆72Updated 2 years ago
- A curated list of papers about key information extraction.☆102Updated 11 months ago
- ☆19Updated 3 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆75Updated last year
- Create TensorRT-runtime for vietocr☆12Updated 4 years ago
- CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks☆186Updated 2 years ago
- ☆62Updated last year
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆104Updated last year
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆149Updated 6 months ago
- CVPR 2022: Table Structure Recognition☆40Updated 3 years ago
- ☆43Updated last year
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Updated last year