thinh-re / s-multimaeLinks
[ICPR-2024] S-MultiMAE - A Multi-Ground Truth approach for RGB-D Saliency Detection
☆12Updated 7 months ago
Alternatives and similar repositories for s-multimae
Users that are interested in s-multimae are comparing it to the libraries listed below
Sorting:
- Vietnamese handwritten text recognition system☆17Updated 4 years ago
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆150Updated 11 months ago
- ☆89Updated 5 months ago
- Code for BMVC2020 paper "Text and Style Conditioned GAN for Generation of Offline Handwriting Lines"☆71Updated 2 years ago
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆70Updated 2 years ago
- A collection of OCR-related datasets☆177Updated 2 years ago
- Scene text recognition☆107Updated 3 years ago
- ☆62Updated last year
- a dataset for camera-based table detection☆16Updated 3 years ago
- PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)☆305Updated last year
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated last year
- Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labelin…☆260Updated last year
- A curated list of papers about key information extraction.☆97Updated 6 months ago
- DocILE: Document Information Localization and Extraction Benchmark☆130Updated last year
- ☆79Updated last year
- Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation☆203Updated last month
- Arbitrary Shape Text Detection via Boundary Transformer;The paper at: https://arxiv.org/abs/2205.05320, which has been accepted by IEEE T…☆191Updated 2 weeks ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆102Updated 2 years ago
- Create TensorRT-runtime for vietocr☆13Updated 4 years ago
- CVPR 2022: Table Structure Recognition☆40Updated 3 years ago
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Updated last year
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆102Updated last year
- The task aims at extracting required fields in receipts captured by mobile devices☆32Updated 2 years ago
- A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research w…☆86Updated 8 months ago
- ☆17Updated 2 years ago
- https://dl.acm.org/doi/10.1145/3657281☆96Updated last year
- Key information extraction from invoice document with Graph Convolution Network☆56Updated 2 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆68Updated last year
- ☆4Updated last month
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆141Updated 2 months ago