AndresPMD/Clip_CMR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AndresPMD/Clip_CMR)

AndresPMD / Clip_CMR

CLIP-based simple image-text matching baseline for COCO and F30K

☆15

Alternatives and similar repositories for Clip_CMR

Users that are interested in Clip_CMR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AndresPMD / semantic_adaptive_margin
View on GitHub
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
☆16Dec 10, 2021Updated 4 years ago
AndresPMD / Fine_Grained_Clf
View on GitHub
Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
☆25Nov 15, 2021Updated 4 years ago
AndresPMD / StacMR
View on GitHub
Scene Text Aware Cross Modal Retrieval (StacMR)
☆24Sep 3, 2021Updated 4 years ago
AndresPMD / Pytorch-yolo-phoc
View on GitHub
Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval
☆13Dec 15, 2021Updated 4 years ago
furkanbiten / stvqa_amazon_ocr
View on GitHub
STVQA and TextVQA OCR results from Amazon Text in Image pipeline
☆12Jul 18, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MCLAB-OCR / KnowledgeMiningWithSceneText
View on GitHub
☆38Feb 4, 2023Updated 3 years ago
biswassanket / DocSegTr
View on GitHub
A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers
☆59Sep 9, 2024Updated last year
AndresPMD / GCN_classification
View on GitHub
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
☆65Dec 1, 2022Updated 3 years ago
quarrying / khandy
View on GitHub
Handy Utilities for Computer Vision
☆12Updated this week
evanmiltenburg / MeasureDiversity
View on GitHub
Measure the diversity of image descriptions, repository for our COLING 2018 paper.
☆13Dec 29, 2019Updated 6 years ago
wangfeng22 / RDH-planetext
View on GitHub
空域明文可逆信息隐藏
☆11Jul 6, 2020Updated 6 years ago
Xudangliatiger / APE-Loss
View on GitHub
☆21Jul 6, 2022Updated 4 years ago
wangkai930418 / HCV_IIRC
View on GitHub
code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"
☆15Oct 28, 2022Updated 3 years ago
kodenii / Ref-Diff
View on GitHub
Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models
☆21May 29, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Yvonneupup / DHNE
View on GitHub
This repository contains implementation of DHNE : Network Representation Learning Method for Dynamic Heterogeneous Network.
☆10May 11, 2019Updated 7 years ago
ZixuanNi / Mod-X
View on GitHub
The reproduce of paper "Continual Vision-Language Representation Learning with Off-Diagonal Information ".(Mod-X)
☆12Oct 31, 2023Updated 2 years ago
statusrank / A-Generic-Framework-for-Optimizing-Two-way-Partial-AUC
View on GitHub
This is an official PyTorch code for our accepted paper "When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-w…
☆15Jul 7, 2022Updated 4 years ago
mugen-org / MUGEN_coinrun
View on GitHub
A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …
☆13Jul 13, 2022Updated 4 years ago
Jielin-Qiu / MM_Robustness
View on GitHub
[DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift
☆39Jan 25, 2024Updated 2 years ago
expectorlin / DR-Attacker
View on GitHub
code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)
☆10Jul 15, 2022Updated 4 years ago
furkanbiten / object-bias
View on GitHub
Let there be clock in the beach - WACV 2022
☆15Nov 15, 2021Updated 4 years ago
mailcorahul / auto_labeler
View on GitHub
auto_labeler - An all-in-one library to automatically label vision data
☆22Jan 17, 2025Updated last year
gauravanand25 / cnn-convlstm-time-series
View on GitHub
Inspired by the success and computational efficiency of convolutional architectures for various sequential tasks compared to recurrent ne…
☆19Jan 23, 2018Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 3 years ago
zhaoyanpeng / vipant
View on GitHub
VIsually-Pivoted Audio and(N) Text
☆22May 16, 2022Updated 4 years ago
meyerscetbon / LinearSinkhorn
View on GitHub
☆17Oct 22, 2020Updated 5 years ago
f-rumblefish / Multi-Label-Fashion-MNIST
View on GitHub
Multi-Label Classification and Class Activation Map on Fashion MNIST
☆11Mar 5, 2019Updated 7 years ago
volkancirik / groundnet
View on GitHub
Repository for AAAI 2018 paper "Using Syntax for Referring Expression Recognition"
☆13Oct 7, 2020Updated 5 years ago
ubc-vision / TriBERT
View on GitHub
Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS…
☆14Dec 9, 2021Updated 4 years ago
liuaishan / SpatiotemporalAttack
View on GitHub
☆13Dec 8, 2022Updated 3 years ago
expectorlin / ADAPT
View on GitHub
code for the paper "ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts" (CVPR 2022)
☆10Jul 17, 2022Updated 4 years ago
THU-TAI / OOD_Tutorial
View on GitHub
A hands-on & simple tutorial for out-of-distribution generalization.
☆18Apr 23, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kaiw7 / STG-CMA
View on GitHub
Towards Efficient Audio-Visual Learners via Empowering Pre-trained Vision Transformers with Cross-Modal Adaptation
☆15Apr 13, 2024Updated 2 years ago
dimipapa / cookingprograms
View on GitHub
Code and data for "Learning Program Representations for Food Images and Cooking Recipes" (oral at CVPR 2022)
☆15Mar 30, 2022Updated 4 years ago
google / mcic-coco
View on GitHub
☆24Dec 22, 2016Updated 9 years ago
SooLab / Plain-Det
View on GitHub
[ECCV 2024] The official PyTorch implementation of the "Plain-Det: A Plain Multi-Dataset Object Detector".
☆30Dec 8, 2024Updated last year
xieyxclack / factual_coco
View on GitHub
The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.
☆17Nov 11, 2021Updated 4 years ago
lluisgomez / TextTopicNet
View on GitHub
Self-supervised learning of visual features through embedding images into text topic spaces
☆95Aug 20, 2022Updated 3 years ago
NiccoloCavagnero / IncrementalLearning
View on GitHub
Pytorch implementation of iCaRL with some extras.
☆16Nov 29, 2020Updated 5 years ago