Scene Text Aware Cross Modal Retrieval (StacMR)
☆24Sep 3, 2021Updated 4 years ago
Alternatives and similar repositories for StacMR
Users that are interested in StacMR are comparing it to the libraries listed below
Sorting:
- Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval☆13Dec 15, 2021Updated 4 years ago
- WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching☆16Dec 10, 2021Updated 4 years ago
- Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval☆64Dec 1, 2022Updated 3 years ago
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆11Jul 18, 2022Updated 3 years ago
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features☆25Nov 15, 2021Updated 4 years ago
- CLIP-based simple image-text matching baseline for COCO and F30K☆14Sep 16, 2021Updated 4 years ago
- ☆38Feb 4, 2023Updated 3 years ago
- ☆82Jun 29, 2023Updated 2 years ago
- Hadwritten Text Recognition in Few-shot Scenario☆22Mar 25, 2023Updated 2 years ago
- ☆25May 11, 2022Updated 3 years ago
- ICDAR 2019☆25Aug 2, 2019Updated 6 years ago
- The imdb files with SBD-Trans OCR for TextVQA dataset.☆11Nov 30, 2021Updated 4 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training☆33Nov 24, 2022Updated 3 years ago
- ☆30May 7, 2021Updated 4 years ago
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20☆29May 26, 2022Updated 3 years ago
- Let there be clock in the beach - WACV 2022☆15Nov 15, 2021Updated 4 years ago
- ☆14May 26, 2023Updated 2 years ago
- ☆16Oct 17, 2024Updated last year
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆59Sep 9, 2024Updated last year
- Single Shot Scene Text Retrieval, ECCV 2018. L. Gomez*, A. Mafla*, M. Rusiñol, D. Karatzas.☆68May 13, 2019Updated 6 years ago
- Good News Everyone! - CVPR 2019☆128Apr 14, 2022Updated 3 years ago
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting☆68Oct 9, 2023Updated 2 years ago
- Source code of our TCSVT 2017 paper "SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval"☆15May 29, 2019Updated 6 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- ☆15Apr 30, 2022Updated 3 years ago
- HHH☆36May 2, 2022Updated 3 years ago
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆106Mar 28, 2024Updated last year
- ☆13Feb 1, 2022Updated 4 years ago
- Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.☆36Jun 16, 2023Updated 2 years ago
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆40Jun 26, 2024Updated last year
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Dec 2, 2022Updated 3 years ago
- [WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆22Oct 11, 2025Updated 4 months ago
- DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022☆186Jan 17, 2025Updated last year
- [ACM MM 2020] Exploring Font-independent Features for Scene Text Recognition☆44Nov 30, 2020Updated 5 years ago
- [ECCV 2022] Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation☆19Jul 18, 2022Updated 3 years ago
- A new video text spotting framework with Transformer☆78May 23, 2022Updated 3 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆22Mar 19, 2022Updated 3 years ago
- Demo code for DeepText (ICASSP 2017)☆22Jun 24, 2017Updated 8 years ago