Scene Text Aware Cross Modal Retrieval (StacMR)
☆24Sep 3, 2021Updated 4 years ago
Alternatives and similar repositories for StacMR
Users that are interested in StacMR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching☆16Dec 10, 2021Updated 4 years ago
- Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval☆13Dec 15, 2021Updated 4 years ago
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features☆25Nov 15, 2021Updated 4 years ago
- Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval☆64Dec 1, 2022Updated 3 years ago
- CLIP-based simple image-text matching baseline for COCO and F30K☆14Sep 16, 2021Updated 4 years ago
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆12Jul 18, 2022Updated 3 years ago
- ☆38Feb 4, 2023Updated 3 years ago
- Let there be clock in the beach - WACV 2022☆15Nov 15, 2021Updated 4 years ago
- ☆82Jun 29, 2023Updated 2 years ago
- ICDAR 2019☆25Aug 2, 2019Updated 6 years ago
- Hadwritten Text Recognition in Few-shot Scenario☆22Mar 25, 2023Updated 2 years ago
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆59Sep 9, 2024Updated last year
- ☆30May 7, 2021Updated 4 years ago
- Good News Everyone! - CVPR 2019☆128Apr 14, 2022Updated 3 years ago
- ☆25May 11, 2022Updated 3 years ago
- Single Shot Scene Text Retrieval, ECCV 2018. L. Gomez*, A. Mafla*, M. Rusiñol, D. Karatzas.☆68May 13, 2019Updated 6 years ago
- [WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆22Oct 11, 2025Updated 5 months ago
- DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022☆186Jan 17, 2025Updated last year
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- Document Image Enhancement with GANs - TPAMI journal☆217Mar 24, 2023Updated 2 years ago
- The imdb files with SBD-Trans OCR for TextVQA dataset.☆11Nov 30, 2021Updated 4 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆92Jul 16, 2021Updated 4 years ago
- Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training☆34Nov 24, 2022Updated 3 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Jul 12, 2023Updated 2 years ago
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆40Jun 26, 2024Updated last year
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval☆129Feb 26, 2020Updated 6 years ago
- SigNet: Convolutional Siamese Network for Writer Independent Offline Signature Verification☆80Oct 24, 2017Updated 8 years ago
- ☆14May 26, 2023Updated 2 years ago
- Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.☆36Jun 16, 2023Updated 2 years ago
- [ECCV 2022] Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation☆19Jul 18, 2022Updated 3 years ago
- A modular framework for Visual Question Answering research by the FAIR A-STAR team☆45Aug 26, 2021Updated 4 years ago
- Doodle to Search: Practical Zero Shot Sketch Based Image Retrieval☆80Sep 30, 2022Updated 3 years ago
- Measure the diversity of image descriptions, repository for our COLING 2018 paper.☆13Dec 29, 2019Updated 6 years ago
- PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).☆47Aug 19, 2021Updated 4 years ago
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting☆70Oct 9, 2023Updated 2 years ago
- Self-supervised learning of visual features through embedding images into text topic spaces☆94Aug 20, 2022Updated 3 years ago
- ☆16Jun 14, 2024Updated last year