Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval
☆13Dec 15, 2021Updated 4 years ago
Alternatives and similar repositories for Pytorch-yolo-phoc
Users that are interested in Pytorch-yolo-phoc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features☆25Nov 15, 2021Updated 4 years ago
- Scene Text Aware Cross Modal Retrieval (StacMR)☆24Sep 3, 2021Updated 4 years ago
- WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching☆16Dec 10, 2021Updated 4 years ago
- Single Shot Scene Text Retrieval, ECCV 2018. L. Gomez*, A. Mafla*, M. Rusiñol, D. Karatzas.☆68May 13, 2019Updated 6 years ago
- Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval☆64Dec 1, 2022Updated 3 years ago
- ☆82Jun 29, 2023Updated 2 years ago
- Hadwritten Text Recognition in Few-shot Scenario☆22Mar 25, 2023Updated 2 years ago
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆12Jul 18, 2022Updated 3 years ago
- ICDAR 2019☆25Aug 2, 2019Updated 6 years ago
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆59Sep 9, 2024Updated last year
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Aug 20, 2022Updated 3 years ago
- Good News Everyone! - CVPR 2019☆128Apr 14, 2022Updated 3 years ago
- [WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆23Oct 11, 2025Updated 5 months ago
- Self-supervised learning of visual features through embedding images into text topic spaces☆94Aug 20, 2022Updated 3 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- A new video text spotting framework with Transformer☆81May 23, 2022Updated 3 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Jul 12, 2023Updated 2 years ago
- DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022☆186Jan 17, 2025Updated last year
- SigNet: Convolutional Siamese Network for Writer Independent Offline Signature Verification☆80Oct 24, 2017Updated 8 years ago
- Measure the diversity of image descriptions, repository for our COLING 2018 paper.☆13Dec 29, 2019Updated 6 years ago
- ☆16Jun 14, 2024Updated last year
- code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"☆15Oct 28, 2022Updated 3 years ago
- Tools to estimate the correlation of different text-based evaluation measures for Automatic Image Description☆10Feb 2, 2017Updated 9 years ago
- baselines for DocVQA dataset☆21Apr 11, 2021Updated 4 years ago
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆15Sep 6, 2024Updated last year
- ☆38Feb 4, 2023Updated 3 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Sep 12, 2023Updated 2 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆22Mar 19, 2022Updated 4 years ago
- One-Pixel Shortcut: on the Learning Preference of Deep Neural Networks (ICLR 2023 Spotlight)☆14Sep 28, 2025Updated 5 months ago
- ☆10Jul 28, 2022Updated 3 years ago
- [NeurIPS 2019] Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries☆12Apr 15, 2022Updated 3 years ago
- Document Image Enhancement with GANs - TPAMI journal☆217Mar 24, 2023Updated 2 years ago
- ☆12Mar 12, 2023Updated 3 years ago
- auto_labeler - An all-in-one library to automatically label vision data☆19Jan 17, 2025Updated last year
- Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression☆14Mar 22, 2025Updated last year
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- Who's Waldo? Linking People Across Text and Images. ICCV 2021.☆13May 17, 2023Updated 2 years ago
- ☆11Sep 16, 2021Updated 4 years ago
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Jul 6, 2022Updated 3 years ago