MCLAB-OCR/KnowledgeMiningWithSceneText

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MCLAB-OCR/KnowledgeMiningWithSceneText)

MCLAB-OCR / KnowledgeMiningWithSceneText

☆38

Alternatives and similar repositories for KnowledgeMiningWithSceneText

Users that are interested in KnowledgeMiningWithSceneText are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AndresPMD / Fine_Grained_Clf
View on GitHub
Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
☆25Nov 15, 2021Updated 4 years ago
AndresPMD / StacMR
View on GitHub
Scene Text Aware Cross Modal Retrieval (StacMR)
☆24Sep 3, 2021Updated 4 years ago
AndresPMD / GCN_classification
View on GitHub
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
☆65Dec 1, 2022Updated 3 years ago
AndresPMD / Clip_CMR
View on GitHub
CLIP-based simple image-text matching baseline for COCO and F30K
☆15Sep 16, 2021Updated 4 years ago
furkanbiten / stvqa_amazon_ocr
View on GitHub
STVQA and TextVQA OCR results from Amazon Text in Image pipeline
☆12Jul 18, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wangkai930418 / HCV_IIRC
View on GitHub
code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"
☆15Oct 28, 2022Updated 3 years ago
TencentARC / BTS
View on GitHub
BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild
☆33Apr 16, 2024Updated 2 years ago
ayumiymk / DiG
View on GitHub
Official PyTorch implementation of `Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition`
☆74Feb 27, 2023Updated 3 years ago
lanfeng4659 / STR-TDSL
View on GitHub
☆82Jun 29, 2023Updated 3 years ago
marcopede / AreasOfAttention
View on GitHub
☆10Apr 20, 2018Updated 8 years ago
microsoft / TAP
View on GitHub
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)
☆72May 22, 2023Updated 3 years ago
shannanyinxiang / SPTS
View on GitHub
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
☆145Jul 26, 2023Updated 2 years ago
furkanbiten / object-bias
View on GitHub
Let there be clock in the beach - WACV 2022
☆15Nov 15, 2021Updated 4 years ago
doc-analysis / DocBankLoader
View on GitHub
DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.
☆24Mar 17, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AndresPMD / Pytorch-yolo-phoc
View on GitHub
Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval
☆13Dec 15, 2021Updated 4 years ago
whai362 / pan_pp_stable
View on GitHub
☆27Oct 9, 2022Updated 3 years ago
Canjie-Luo / Real-300K
View on GitHub
The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…
☆34Jun 21, 2022Updated 4 years ago
LARS-research / TREFE
View on GitHub
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Nov 25, 2022Updated 3 years ago
ChenyuGAO-CS / SMA
View on GitHub
The imdb files with SBD-Trans OCR for TextVQA dataset.
☆11Nov 30, 2021Updated 4 years ago
phiphiphi31 / DualTFR
View on GitHub
☆18Jun 26, 2023Updated 3 years ago
xinke-wang / Awesome-Text-VQA
View on GitHub
☆188May 8, 2024Updated 2 years ago
HqiTao / SplatHLoc
View on GitHub
Official code for CVPR 2026 paper "Hierarchical Visual Relocalization with Nearest View Synthesis from Feature Gaussian Splatting"
☆19May 1, 2026Updated 2 months ago
uakarsh / latr
View on GitHub
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…
☆56Oct 30, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shengtao96 / CentripetalText
View on GitHub
☆29Aug 31, 2022Updated 3 years ago
fawazsammani / look-and-modify
View on GitHub
Look and Modify: Modification Networks for Image Captioning, BMVC 2019
☆21Feb 18, 2020Updated 6 years ago
mengqiDyangge / HierKD
View on GitHub
☆39Aug 25, 2022Updated 3 years ago
usydnlp / vdoc
View on GitHub
☆15Sep 7, 2022Updated 3 years ago
weijiawu / TransDETR
View on GitHub
[IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer
☆114Mar 28, 2024Updated 2 years ago
xmu-xiaoma666 / SDATR
View on GitHub
Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)
☆19Oct 15, 2022Updated 3 years ago
aurooj / WSG-VQA-VLTransformers
View on GitHub
Weakly Supervised Grounding for VQA in Vision-Language Transformers
☆16May 6, 2023Updated 3 years ago
wenwenyu / TCM
View on GitHub
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
☆202Jun 17, 2024Updated 2 years ago
lpcvai / 21LPCVC-UAV_VIdeo_Track-Sample-Solution
View on GitHub
☆25Oct 10, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Gitsamshi / WeakVRD-Captioning
View on GitHub
Implementation of paper "Improving Image Captioning with Better Use of Caption"
☆33Sep 15, 2020Updated 5 years ago
joanrod / ocr-vqgan
View on GitHub
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…
☆84Jan 30, 2023Updated 3 years ago
zhaominyiz / STIRER
View on GitHub
STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023
☆14Dec 2, 2024Updated last year
Actasidiot / EFIFSTR
View on GitHub
[ACM MM 2020] Exploring Font-independent Features for Scene Text Recognition
☆44Nov 30, 2020Updated 5 years ago
Yochengliu / MLIC-KD-WSD
View on GitHub
Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection (ACM MM 2018)
☆59Sep 30, 2021Updated 4 years ago
Yvonneupup / DHNE
View on GitHub
This repository contains implementation of DHNE : Network Representation Learning Method for Dynamic Heterogeneous Network.
☆10May 11, 2019Updated 7 years ago
AndresPMD / semantic_adaptive_margin
View on GitHub
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
☆16Dec 10, 2021Updated 4 years ago