Official Code for Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
☆15Dec 27, 2023Updated 2 years ago
Alternatives and similar repositories for CSTBIR
Users that are interested in CSTBIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Oct 21, 2022Updated 3 years ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆40Apr 11, 2025Updated last year
- ☆19Jul 28, 2025Updated 9 months ago
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆20Nov 4, 2024Updated last year
- Code and Dataset for FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context.☆22Jun 19, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval☆26Apr 9, 2026Updated 3 weeks ago
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆86Jul 4, 2024Updated last year
- A simple Sketch-Based Image Retrieval web application, implemented by Flask and PyTorch.☆33May 13, 2020Updated 5 years ago
- The official repository of MM-R5☆29Jun 22, 2025Updated 10 months ago
- Open Vocabulary Semantic Scene Sketch Understanding☆29Jul 1, 2024Updated last year
- The implementation of FINER-MLLM, which is accepted by MM2024.☆18Oct 8, 2024Updated last year
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆197Jul 31, 2025Updated 9 months ago
- Collection of Composed Image Retrieval (CIR) papers.☆333Apr 27, 2026Updated last week
- This repo contains code for the paper "Compact Descriptors for Sketch-based Image Retrieval using a Triplet loss Convolutional Neural Net…☆15Nov 1, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of TC-Net for iSBIR: Triplet Classification Network for instance-level Sketch Based Image Retrieval.☆21Feb 23, 2020Updated 6 years ago
- The official code of "Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search"☆27Sep 15, 2025Updated 7 months ago
- A PyTorch implementation of ACNet based on TCSVT 2023 paper "ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image…☆11Dec 8, 2023Updated 2 years ago
- Composed Video Retrieval☆62May 2, 2024Updated 2 years ago
- Fast Semantic Segmentation Image Annotation with Segment Anything Model (SAM)☆14Mar 23, 2024Updated 2 years ago
- 3D Face Alignment ---The 10th International Conference on Image and Graphics(ICIG2019)-Oral☆11Dec 3, 2019Updated 6 years ago
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆145Jan 5, 2026Updated 4 months ago
- Adversarial detection and defense for deep learning systems using robust feature alignment☆17Nov 10, 2020Updated 5 years ago
- CVPR2023: Few-Shot Learning with Visual Distribution Calibration and Cross-Modal Distribution Alignment☆15May 19, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆54May 27, 2025Updated 11 months ago
- ☆25May 8, 2025Updated 11 months ago
- [AAAI 2024] XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning.☆16Jul 9, 2024Updated last year
- [ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts☆14Jan 13, 2025Updated last year
- A PyTorch implementation for video style transfer☆16Jan 8, 2020Updated 6 years ago
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆18Oct 31, 2024Updated last year
- Repo of NeurIPS23☆18Oct 25, 2023Updated 2 years ago
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023☆17Jul 24, 2023Updated 2 years ago
- Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)☆279Mar 26, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).☆15Sep 25, 2025Updated 7 months ago
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆91Jul 13, 2024Updated last year
- Official Repository for ICML 2024 Paper "OT-CLIP: Understanding and Generalizing CLIP via Optimal Transport"☆24Dec 4, 2025Updated 5 months ago
- ZSE-SBIR☆55Oct 25, 2023Updated 2 years ago
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆87Aug 6, 2025Updated 8 months ago
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆24Feb 9, 2024Updated 2 years ago