Official Code for Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
☆15Dec 27, 2023Updated 2 years ago
Alternatives and similar repositories for CSTBIR
Users that are interested in CSTBIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆29Oct 25, 2025Updated 8 months ago
- A PyTorch implementation of ClipPrompt based on CVPR 2023 paper "CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained…☆18Nov 5, 2023Updated 2 years ago
- Project page for the paper 'CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not'☆79Aug 6, 2023Updated 2 years ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆41Apr 11, 2025Updated last year
- Official Implementation of Few-shot Visual Relationship Co-localization☆25Aug 25, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated 2 years ago
- Flow Chart Image-to-Code Generation☆37Aug 13, 2023Updated 2 years ago
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆20Nov 4, 2024Updated last year
- Comprehensive Scene Text Recognition Toolkit across 11 Indian Languages☆51Jun 27, 2026Updated last week
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆88Jul 4, 2024Updated 2 years ago
- The official repository of MM-R5☆29Jun 22, 2025Updated last year
- Open Vocabulary Semantic Scene Sketch Understanding☆27Jul 1, 2024Updated 2 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆26Jul 21, 2025Updated 11 months ago
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆198Jul 31, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repo contains code for the paper "Compact Descriptors for Sketch-based Image Retrieval using a Triplet loss Convolutional Neural Net…☆15Nov 1, 2018Updated 7 years ago
- Implementation of TC-Net for iSBIR: Triplet Classification Network for instance-level Sketch Based Image Retrieval.☆21Feb 23, 2020Updated 6 years ago
- ☆197May 9, 2026Updated last month
- ☆11Nov 28, 2022Updated 3 years ago
- 基于STM32的指纹锁设计,可以实现指纹识别和输出信号。硬件上用的STM32F103C8T6,AS608。☆11Jul 29, 2022Updated 3 years ago
- The official code of "Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search"☆32Sep 15, 2025Updated 9 months ago
- [ICCV 2021] PyTorch implementation of "Universal Cross-Domain Retrieval: Generalizing across Classes and Domains"☆11Sep 26, 2021Updated 4 years ago
- ME-GraphAU on Video☆11May 10, 2024Updated 2 years ago
- Composed Video Retrieval☆62May 2, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fast Semantic Segmentation Image Annotation with Segment Anything Model (SAM)☆14Mar 23, 2024Updated 2 years ago
- ☆13Jul 1, 2024Updated 2 years ago
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆148Jan 5, 2026Updated 5 months ago
- CVPR2023: Few-Shot Learning with Visual Distribution Calibration and Cross-Modal Distribution Alignment☆14May 19, 2023Updated 3 years ago
- ☆42Jun 14, 2025Updated last year
- [AAAI 2024] XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning.☆15Jul 9, 2024Updated last year
- [ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts☆13Jan 13, 2025Updated last year
- A PyTorch implementation for video style transfer☆16Jan 8, 2020Updated 6 years ago
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆20Feb 26, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆18Oct 31, 2024Updated last year
- Repo of NeurIPS23☆17Oct 25, 2023Updated 2 years ago
- Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)☆284Mar 26, 2025Updated last year
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆91Jul 13, 2024Updated last year
- ☆12Mar 28, 2024Updated 2 years ago
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆35Feb 22, 2026Updated 4 months ago
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year