Official Code for Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
☆15Dec 27, 2023Updated 2 years ago
Alternatives and similar repositories for CSTBIR
Users that are interested in CSTBIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆29Oct 25, 2025Updated 5 months ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆40Apr 11, 2025Updated last year
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated 2 years ago
- Flow Chart Image-to-Code Generation☆37Aug 13, 2023Updated 2 years ago
- Code and Dataset for FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context.☆22Jun 19, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval☆26Apr 5, 2026Updated last week
- The official repository of MM-R5☆28Jun 22, 2025Updated 9 months ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆26Jul 21, 2025Updated 8 months ago
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆196Jul 31, 2025Updated 8 months ago
- Collection of Composed Image Retrieval (CIR) papers.☆323Mar 27, 2026Updated 2 weeks ago
- This repo contains code for the paper "Compact Descriptors for Sketch-based Image Retrieval using a Triplet loss Convolutional Neural Net…☆15Nov 1, 2018Updated 7 years ago
- ☆11Nov 28, 2022Updated 3 years ago
- SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network☆24Dec 9, 2023Updated 2 years ago
- ☆27Feb 26, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICCV 2021] PyTorch implementation of "Universal Cross-Domain Retrieval: Generalizing across Classes and Domains"☆11Sep 26, 2021Updated 4 years ago
- The official code of "Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search"☆27Sep 15, 2025Updated 6 months ago
- A PyTorch implementation of ACNet based on TCSVT 2023 paper "ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image…☆11Dec 8, 2023Updated 2 years ago
- Composed Video Retrieval☆62May 2, 2024Updated last year
- 3D Face Alignment ---The 10th International Conference on Image and Graphics(ICIG2019)-Oral☆11Dec 3, 2019Updated 6 years ago
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆145Jan 5, 2026Updated 3 months ago
- Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels [CVPR 2023]☆14Sep 23, 2023Updated 2 years ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆25Feb 2, 2025Updated last year
- ☆39Jun 14, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆54May 27, 2025Updated 10 months ago
- [AAAI 2024] XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning.☆16Jul 9, 2024Updated last year
- [ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts☆14Jan 13, 2025Updated last year
- ubuntu 系统下 GLM-4-Voice 部署经验分享