AndresPMD/Pytorch-yolo-phoc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AndresPMD/Pytorch-yolo-phoc)

AndresPMD / Pytorch-yolo-phoc

Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval

☆13

Alternatives and similar repositories for Pytorch-yolo-phoc

Users that are interested in Pytorch-yolo-phoc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AndresPMD / Fine_Grained_Clf
View on GitHub
Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
☆25Nov 15, 2021Updated 4 years ago
AndresPMD / StacMR
View on GitHub
Scene Text Aware Cross Modal Retrieval (StacMR)
☆24Sep 3, 2021Updated 4 years ago
AndresPMD / semantic_adaptive_margin
View on GitHub
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
☆16Dec 10, 2021Updated 4 years ago
lluisgomez / single-shot-str
View on GitHub
Single Shot Scene Text Retrieval, ECCV 2018. L. Gomez*, A. Mafla*, M. Rusiñol, D. Karatzas.
☆68May 13, 2019Updated 7 years ago
AndresPMD / GCN_classification
View on GitHub
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
☆65Dec 1, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
AndresPMD / Clip_CMR
View on GitHub
CLIP-based simple image-text matching baseline for COCO and F30K
☆15Sep 16, 2021Updated 4 years ago
lanfeng4659 / STR-TDSL
View on GitHub
☆82Jun 29, 2023Updated 3 years ago
furkanbiten / object-bias
View on GitHub
Let there be clock in the beach - WACV 2022
☆15Nov 15, 2021Updated 4 years ago
dali92002 / HTRbyMatching
View on GitHub
Hadwritten Text Recognition in Few-shot Scenario
☆22Mar 25, 2023Updated 3 years ago
furkanbiten / stvqa_amazon_ocr
View on GitHub
STVQA and TextVQA OCR results from Amazon Text in Image pipeline
☆12Jul 18, 2022Updated 4 years ago
furkanbiten / SelectiveTextStyleTransfer
View on GitHub
ICDAR 2019
☆25Aug 2, 2019Updated 6 years ago
biswassanket / DocSegTr
View on GitHub
A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers
☆59Sep 9, 2024Updated last year
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
furkanbiten / GoodNews
View on GitHub
Good News Everyone! - CVPR 2019
☆130Apr 14, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ayanban011 / SVGCraft
View on GitHub
[WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
☆24Oct 11, 2025Updated 9 months ago
ahmedssabir / Textual-Visual-Semantic-Dataset-for-Text-Spotting
View on GitHub
Textual Visual Semantic Dataset for Text Spotting. CVPRW 2020
☆12Jul 2, 2022Updated 4 years ago
lluisgomez / TextTopicNet
View on GitHub
Self-supervised learning of visual features through embedding images into text topic spaces
☆95Aug 20, 2022Updated 3 years ago
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 3 years ago
biswassanket / synth_doc_generation
View on GitHub
Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021
☆93Jul 16, 2021Updated 5 years ago
weijiawu / TransVTSpotter
View on GitHub
A new video text spotting framework with Transformer
☆82May 23, 2022Updated 4 years ago
dali92002 / SSL-OCR
View on GitHub
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
☆30Jul 12, 2023Updated 3 years ago
dali92002 / DocEnTR
View on GitHub
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
☆190Jan 17, 2025Updated last year
sounakdey / SigNet
View on GitHub
SigNet: Convolutional Siamese Network for Writer Independent Offline Signature Verification
☆80Oct 24, 2017Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
evanmiltenburg / MeasureDiversity
View on GitHub
Measure the diversity of image descriptions, repository for our COLING 2018 paper.
☆13Dec 29, 2019Updated 6 years ago
aiintelligentsystems / next-level-bert
View on GitHub
☆16Jun 14, 2024Updated 2 years ago
ayanban011 / GraphKD
View on GitHub
[ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation
☆16Sep 6, 2024Updated last year
wangkai930418 / HCV_IIRC
View on GitHub
code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"
☆15Oct 28, 2022Updated 3 years ago
hecoding / Hyper-Modulation
View on GitHub
Official Implementation for "Transferring Unconditional to Conditional GANs with Hyper-Modulation" CVPRW 22 https://arxiv.org/abs/2112.02…
☆13Jun 28, 2022Updated 4 years ago
frostinassiky / bsp
View on GitHub
Placeholder for code of BSP.
☆11Aug 13, 2021Updated 4 years ago
MCLAB-OCR / KnowledgeMiningWithSceneText
View on GitHub
☆38Feb 4, 2023Updated 3 years ago
amazon-science / semimtr-text-recognition
View on GitHub
Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)
☆83Sep 12, 2023Updated 2 years ago
showlab / DemoVLP
View on GitHub
[Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training
☆22Mar 19, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
mineshmathew / DocVQA
View on GitHub
baselines for DocVQA dataset
☆21Apr 11, 2021Updated 5 years ago
cychomatica / One-Pixel-Shotcut
View on GitHub
One-Pixel Shortcut: on the Learning Preference of Deep Neural Networks (ICLR 2023 Spotlight)
☆14Sep 28, 2025Updated 9 months ago
dayu11 / Availability-Attacks-Create-Shortcuts
View on GitHub
☆10Jul 28, 2022Updated 3 years ago
dali92002 / DE-GAN
View on GitHub
Document Image Enhancement with GANs - TPAMI journal
☆222Mar 24, 2023Updated 3 years ago
mailcorahul / auto_labeler
View on GitHub
auto_labeler - An all-in-one library to automatically label vision data
☆22Jan 17, 2025Updated last year
rxtan2 / video-grounding-narrations
View on GitHub
☆12Mar 12, 2023Updated 3 years ago
snap-research / CoSearch
View on GitHub
CoSearch: Joint Training of Reasoning and Document Ranking via Reinforcement Learning for Agentic Search
☆15Apr 28, 2026Updated 2 months ago