[WWW 2025 Oral] ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning
☆21Jul 2, 2025Updated 11 months ago
Alternatives and similar repositories for ImageScope
Users that are interested in ImageScope are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jun 11, 2025Updated last year
- [ICCV 2025] Official PyTorch Code for "Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval"☆18Aug 23, 2025Updated 9 months ago
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- ☆15Sep 30, 2024Updated last year
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆87Aug 6, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Feb 25, 2025Updated last year
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆15Jun 6, 2024Updated 2 years ago
- Computer-Use Agents as Judges for Generative UI☆45Nov 27, 2025Updated 6 months ago
- export any your YOLOv7 model to TensorFlow, TensorFlowJs, ONNX, OpenVINO, RKNN,...☆14Feb 7, 2025Updated last year
- [ACL-main-2026]We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Mode…☆28Jan 27, 2026Updated 4 months ago
- This repository is an implementation of object detection to detect waste in real-time directly on the browser using the TFJS-TFLite Web A…☆17Feb 6, 2024Updated 2 years ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆17Apr 27, 2024Updated 2 years ago
- vue+elementUI 创建的一个好看的UI页面。暂时无js代码,只作为UI展示。☆11Feb 4, 2023Updated 3 years ago
- Final Solution for MAG240M dataset of OGB-LSC@KDDCUP2021☆11Jun 17, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆20Nov 4, 2024Updated last year
- CIFAR-100 dataset by classes folder☆11Nov 7, 2024Updated last year
- A modular and stable agent sandbox runtime environment.☆53May 25, 2026Updated 2 weeks ago
- ☆12Jun 23, 2023Updated 2 years ago
- Cross-Modal Retrieval with Partially Mismatched Pairs (IEEE TPAMI 2023, PyTorch Code)☆23Sep 17, 2023Updated 2 years ago
- [ACL '26 Findings] V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in MLLMs☆27Apr 28, 2026Updated last month
- KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion☆12Oct 21, 2021Updated 4 years ago
- This is an OpenCV implementation of "Image smoothing via L0 Gradient Minimization"☆16Jul 12, 2014Updated 11 years ago
- Code for ECCV 2022 Workshop paper "See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval"☆22Nov 16, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A simple implementation of Structure-Aware Halftone (SIGGRAPH'08).☆21Aug 7, 2014Updated 11 years ago
- Code for 'Geospatial Entity Resolution' paper (WWW 2022)☆19Apr 27, 2023Updated 3 years ago
- ☆47Aug 15, 2023Updated 2 years ago
- ☆24Mar 16, 2025Updated last year
- ☆33Oct 27, 2025Updated 7 months ago
- ☆14Aug 7, 2023Updated 2 years ago
- 初步版IDEA使用指南☆23Jan 18, 2019Updated 7 years ago
- ☆13Oct 23, 2024Updated last year
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆21May 30, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This spider get infomation from ahu-jwc's website to get more convenience.☆12Dec 1, 2017Updated 8 years ago
- Linq's Function Calling Agent utilizing DeepSeek-R1☆30Feb 5, 2025Updated last year
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆28Apr 14, 2026Updated 2 months ago
- [ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".☆116Feb 6, 2026Updated 4 months ago
- EmbedDB Embedded Database for IoT and Sensors Supporting Key-Value and Relational Data☆47May 26, 2026Updated 2 weeks ago
- Official Implementation of DiffCLIP: Differential Attention Meets CLIP☆57Mar 12, 2025Updated last year
- TBD☆59Mar 13, 2026Updated 3 months ago