thinh-re / s-multimaeLinks
[ICPR-2024] S-MultiMAE - A Multi-Ground Truth approach for RGB-D Saliency Detection
☆12Updated 9 months ago
Alternatives and similar repositories for s-multimae
Users that are interested in s-multimae are comparing it to the libraries listed below
Sorting:
- Vietnamese handwritten text recognition system☆17Updated 4 years ago
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆152Updated last year
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆74Updated last year
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆72Updated 2 years ago
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆28Updated 10 months ago
- CVPR 2022: Table Structure Recognition☆40Updated 3 years ago
- Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation☆203Updated 3 months ago
- a dataset for camera-based table detection☆16Updated 4 years ago
- The task aims at extracting required fields in receipts captured by mobile devices☆33Updated 2 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆102Updated 3 years ago
- A curated list of papers about key information extraction.☆100Updated 9 months ago
- DocILE: Document Information Localization and Extraction Benchmark☆135Updated last year
- Create TensorRT-runtime for vietocr☆12Updated 4 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated 2 years ago
- ☆88Updated 7 months ago
- swin-transformer custom for OCR☆115Updated last year
- A collection of OCR-related datasets☆188Updated 3 years ago
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆58Updated last year
- ☆62Updated last year
- PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)☆307Updated last year
- Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labelin…☆263Updated last year
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Updated 2 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆68Updated last year
- Scene text recognition☆107Updated 3 years ago
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆105Updated last year
- ☆20Updated 3 years ago
- ☆18Updated 2 years ago
- Hadwritten Text Recognition in Few-shot Scenario☆22Updated 2 years ago
- CRAFT(Baek et al., 2019) model training code☆50Updated last year
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆130Updated this week