AndresPMD/semantic_adaptive_margin

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AndresPMD/semantic_adaptive_margin)

AndresPMD / semantic_adaptive_margin

WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

☆16

Alternatives and similar repositories for semantic_adaptive_margin

Users that are interested in semantic_adaptive_margin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AndresPMD / StacMR
View on GitHub
Scene Text Aware Cross Modal Retrieval (StacMR)
☆24Sep 3, 2021Updated 4 years ago
AndresPMD / Pytorch-yolo-phoc
View on GitHub
Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval
☆13Dec 15, 2021Updated 4 years ago
furkanbiten / object-bias
View on GitHub
Let there be clock in the beach - WACV 2022
☆15Nov 15, 2021Updated 4 years ago
hardyqr / HAL
View on GitHub
[AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".
☆38Oct 4, 2023Updated 2 years ago
AndresPMD / Fine_Grained_Clf
View on GitHub
Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
☆25Nov 15, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AndresPMD / Clip_CMR
View on GitHub
CLIP-based simple image-text matching baseline for COCO and F30K
☆15Sep 16, 2021Updated 4 years ago
ayanban011 / SVGCraft
View on GitHub
[WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
☆24Oct 11, 2025Updated 9 months ago
biswassanket / DocSegTr
View on GitHub
A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers
☆59Sep 9, 2024Updated last year
ZenglaiMa / image-text-matching-paper-reading
View on GitHub
Paper reading notes in the field of Image-Text Matching/Retrieval.
☆13Mar 25, 2022Updated 4 years ago
yiling2018 / saem
View on GitHub
Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019
☆41Sep 24, 2019Updated 6 years ago
dali92002 / HTRbyMatching
View on GitHub
Hadwritten Text Recognition in Few-shot Scenario
☆22Mar 25, 2023Updated 3 years ago
ayanban011 / GraphKD
View on GitHub
[ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation
☆16Sep 6, 2024Updated last year
CrossmodalGroup / BFAN
View on GitHub
Implementation of our ACMMM2019 paper, Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching
☆39Jun 19, 2023Updated 3 years ago
furkanbiten / stvqa_amazon_ocr
View on GitHub
STVQA and TextVQA OCR results from Amazon Text in Image pipeline
☆12Jul 18, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sunnychencool / AOQ
View on GitHub
Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)
☆34Jul 2, 2020Updated 6 years ago
AndresPMD / GCN_classification
View on GitHub
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
☆65Dec 1, 2022Updated 3 years ago
kywen1119 / DSRAN
View on GitHub
Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.
☆74Oct 25, 2022Updated 3 years ago
biswassanket / synth_doc_generation
View on GitHub
Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021
☆93Jul 16, 2021Updated 5 years ago
dali92002 / OCR-TR
View on GitHub
Optocal Character Recognition (OCR / HTR) using Transformers
☆11Aug 20, 2022Updated 3 years ago
mesnico / ALADIN
View on GitHub
Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"
☆28Dec 6, 2023Updated 2 years ago
BruceW91 / CVSE
View on GitHub
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
☆168Feb 7, 2022Updated 4 years ago
cyh-sj / CGMN
View on GitHub
The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communic…
☆45Jun 5, 2023Updated 3 years ago
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
furkanbiten / SelectiveTextStyleTransfer
View on GitHub
ICDAR 2019
☆25Aug 2, 2019Updated 6 years ago
LgQu / CAMERA
View on GitHub
Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20
☆29May 26, 2022Updated 4 years ago
dali92002 / DocEnTR
View on GitHub
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
☆190Jan 17, 2025Updated last year
mesnico / TERN
View on GitHub
Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144
☆58Dec 6, 2023Updated 2 years ago
Shiyang-Yan / Discrete-continous-PG-for-Retrieval
View on GitHub
☆13Feb 1, 2022Updated 4 years ago
CrossmodalGroup / CMCAN
View on GitHub
Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.
☆36Jun 16, 2023Updated 3 years ago
WangFei-2019 / Image-text-Retrieval
View on GitHub
☆47Jan 14, 2026Updated 6 months ago
andreineculai / MPC
View on GitHub
☆25May 11, 2022Updated 4 years ago
dali92002 / DE-GAN
View on GitHub
Document Image Enhancement with GANs - TPAMI journal
☆222Mar 24, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
emanuelevivoli / awesome-comics-understanding
View on GitHub
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
☆139Jan 2, 2025Updated last year
dali92002 / SSL-OCR
View on GitHub
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
☆30Jul 12, 2023Updated 3 years ago
penghu-cs / MRL
View on GitHub
Learning Cross-Modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)
☆56Mar 5, 2023Updated 3 years ago
Roc-Ng / HANet
View on GitHub
PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).
☆47Aug 19, 2021Updated 4 years ago
iLearn-Lab / SIGIR21-DIME
View on GitHub
Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21
☆70Apr 5, 2026Updated 3 months ago
XLearning-SCU / 2021-NeurIPS-NCR
View on GitHub
☆82Nov 6, 2023Updated 2 years ago
niluthpol / multimodal_vtt
View on GitHub
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
☆68Apr 10, 2020Updated 6 years ago