thinh-re / s-multimaeLinks
[ICPR-2024] S-MultiMAE - A Multi-Ground Truth approach for RGB-D Saliency Detection
☆12Updated last year
Alternatives and similar repositories for s-multimae
Users that are interested in s-multimae are comparing it to the libraries listed below
Sorting:
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆152Updated last year
- Vietnamese handwritten text recognition system☆17Updated 4 years ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆69Updated last year
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆73Updated 2 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆139Updated last year
- a dataset for camera-based table detection☆16Updated 4 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated 2 years ago
- https://dl.acm.org/doi/10.1145/3657281☆97Updated last year
- PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)☆313Updated last year
- A curated list of papers about key information extraction.☆104Updated last year
- A collection of OCR-related datasets☆205Updated 3 years ago
- ☆62Updated 2 years ago
- Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labelin…☆266Updated last year
- Create TensorRT-runtime for vietocr☆12Updated 4 years ago
- CVPR 2022: Table Structure Recognition☆40Updated 3 years ago
- ☆18Updated last year
- TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)☆30Updated last year
- Hadwritten Text Recognition in Few-shot Scenario☆22Updated 2 years ago
- Scene text recognition☆108Updated 3 years ago
- ☆89Updated 11 months ago
- ☆78Updated 2 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- ☆19Updated 2 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆63Updated last year
- This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …☆182Updated 4 years ago
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆66Updated 11 months ago
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 3 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Updated 3 years ago
- CRAFT(Baek et al., 2019) model training code☆51Updated last year
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Updated last year