vl2g/CSTBIR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vl2g/CSTBIR)

vl2g / CSTBIR

Official Code for Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions

☆15

Alternatives and similar repositories for CSTBIR

Users that are interested in CSTBIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vl2g / Sketch-Inpainting
View on GitHub
☆29Oct 25, 2025Updated 8 months ago
leftthomas / ClipPrompt
View on GitHub
A PyTorch implementation of ClipPrompt based on CVPR 2023 paper "CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained…
☆18Nov 5, 2023Updated 2 years ago
aneeshan95 / Sketch_LVM
View on GitHub
Project page for the paper 'CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not'
☆78Aug 6, 2023Updated 2 years ago
Abhiram4572 / mi_bart
View on GitHub
☆13Oct 23, 2024Updated last year
vl2g / MATR
View on GitHub
Official Implementation of Moment Alignment Transformer
☆16Oct 18, 2025Updated 9 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
vl2g / VRC
View on GitHub
Official Implementation of Few-shot Visual Relationship Co-localization
☆25Aug 25, 2021Updated 4 years ago
icq-benchmark / icq-benchmark
View on GitHub
☆19Jul 28, 2025Updated 11 months ago
vl2g / MPA
View on GitHub
Implementation of Model Parity Alignment
☆20Nov 19, 2025Updated 8 months ago
pinakinathc / fscoco
View on GitHub
Code and Dataset for FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context.
☆21Jun 19, 2023Updated 3 years ago
ExplainableML / Vision_by_Language
View on GitHub
[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
☆89Jul 4, 2024Updated 2 years ago
CSfufu / VidSketch
View on GitHub
We propose VidSketch, the first method capable of generating high-quality video animations directly from any number of hand-drawn sketche…
☆23Jun 10, 2025Updated last year
AhmedBourouis / Scene-Sketch-Segmentation
View on GitHub
Open Vocabulary Semantic Scene Sketch Understanding
☆27Jul 1, 2024Updated 2 years ago
xianzhangzx / FINER-MLLM
View on GitHub
The implementation of FINER-MLLM, which is accepted by MM2024.
☆18Oct 8, 2024Updated last year
GUET-PDK / pdk-mini
View on GitHub
GUET跑得快微信小程序——校园跑腿系统（20级软工课设）
☆14Jun 20, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Qinguohui / -STM32-
View on GitHub
基于STM32的指纹锁设计，可以实现指纹识别和输出信号。硬件上用的STM32F103C8T6,AS608。
☆11Jul 29, 2022Updated 3 years ago
vec-ai / wikiHow-TIIR
View on GitHub
[ACL 2025] Towards Text-Image Interleaved Retrieval
☆16Sep 3, 2025Updated 10 months ago
PyJulie / MONICA
View on GitHub
☆27Apr 23, 2026Updated 3 months ago
Beichen1996 / SRAAL
View on GitHub
State-Relabeling Adversarial Active Learning
☆14Aug 17, 2021Updated 4 years ago
OmkarThawakar / composed-video-retrieval
View on GitHub
Composed Video Retrieval
☆62May 2, 2024Updated 2 years ago
fpv-iplab / stillfast
View on GitHub
Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…
☆14Apr 11, 2023Updated 3 years ago
wudi-ldd / SAM-Annotation
View on GitHub
Fast Semantic Segmentation Image Annotation with Segment Anything Model (SAM)
☆14Mar 23, 2024Updated 2 years ago
uestc-xyh / ComqueryFormer
View on GitHub
☆11Nov 28, 2022Updated 3 years ago
hrtang22 / MUSE
View on GitHub
Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"
☆26Feb 2, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pritamqu / XKD
View on GitHub
[AAAI 2024] XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning.
☆15Jul 9, 2024Updated 2 years ago
iLearn-Lab / TOIS25-Awesome-Composed-Image-Retrieval
View on GitHub
Collection of Composed Image Retrieval (CIR) papers.
☆360Jun 8, 2026Updated last month
Code2Q / TagCF
View on GitHub
☆17Nov 6, 2025Updated 8 months ago
wds2014 / ALIGN
View on GitHub
Repo of NeurIPS23
☆17Oct 25, 2023Updated 2 years ago
EnchanterXiao / video-style-transfer
View on GitHub
A PyTorch implementation for video style transfer
☆16Jan 8, 2020Updated 6 years ago
yaoweilee / PMF
View on GitHub
Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023
☆16Jul 24, 2023Updated 3 years ago
anosorae / IRRA
View on GitHub
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)
☆285Mar 26, 2025Updated last year
hhc1997 / MSCN
View on GitHub
☆12Mar 28, 2024Updated 2 years ago
MPI-Lab / MLLM4Text-ReID
View on GitHub
Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)
☆91Jul 13, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
dhg-wei / MCL
View on GitHub
(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
☆28Sep 27, 2024Updated last year
WangWenhao0716 / TransHP
View on GitHub
[NeurIPS 2023] The official implementation of "TransHP: Image Classification with Hierarchical Prompting"
☆20Dec 9, 2023Updated 2 years ago
HeartbreakSurvivor / ETR
View on GitHub
ETR: An Efficient Transformer for Re-ranking in Visual Place Recognition (WACV 2023)
☆17Nov 10, 2022Updated 3 years ago
Cuberick-Orion / CIRPLANT
View on GitHub
Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…
☆39Jun 26, 2024Updated 2 years ago
miccunifi / CIRCO
View on GitHub
[ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset
☆87Aug 6, 2025Updated 11 months ago
WinKawaks / SketchDreamer
View on GitHub
[BMVC 2023 (Oral)] SketchDreamer: Interactive Text-Augmented Creative Sketch Ideation
☆28Jun 8, 2025Updated last year
hackerchenzhuo / LaKo
View on GitHub
[Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection
☆24Feb 9, 2024Updated 2 years ago