[WWW 2025 Oral] ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning
☆20Jul 2, 2025Updated 8 months ago
Alternatives and similar repositories for ImageScope
Users that are interested in ImageScope are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jun 11, 2025Updated 9 months ago
- [ICCV 2025] Official PyTorch Code for "Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval"☆17Aug 23, 2025Updated 7 months ago
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- ☆15Sep 30, 2024Updated last year
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆86Aug 6, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆13Feb 25, 2025Updated last year
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆15Jun 6, 2024Updated last year
- Computer-Use Agents as Judges for Generative UI☆44Nov 27, 2025Updated 3 months ago
- We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Models on chart-to-…☆24Jan 27, 2026Updated last month
- export any your YOLOv7 model to TensorFlow, TensorFlowJs, ONNX, OpenVINO, RKNN,...☆14Feb 7, 2025Updated last year
- This repository is an implementation of object detection to detect waste in real-time directly on the browser using the TFJS-TFLite Web A…☆16Feb 6, 2024Updated 2 years ago
- vue+elementUI 创建的一个好看的UI页面。暂时无js代码,只作为UI展示。☆11Feb 4, 2023Updated 3 years ago
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆20Nov 4, 2024Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆17Apr 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Final Solution for MAG240M dataset of OGB-LSC@KDDCUP2021☆11Jun 17, 2021Updated 4 years ago
- A modular and stable agent sandbox runtime environment.☆46Jan 8, 2026Updated 2 months ago
- ☆12Jun 23, 2023Updated 2 years ago
- Cross-Modal Retrieval with Partially Mismatched Pairs (IEEE TPAMI 2023, PyTorch Code)☆23Sep 17, 2023Updated 2 years ago
- V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in MLLMs☆24Jul 31, 2025Updated 7 months ago
- KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion☆12Oct 21, 2021Updated 4 years ago
- This is an OpenCV implementation of "Image smoothing via L0 Gradient Minimization"☆16Jul 12, 2014Updated 11 years ago
- Code for ECCV 2022 Workshop paper "See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval"☆21Nov 16, 2025Updated 4 months ago
- A simple implementation of Structure-Aware Halftone (SIGGRAPH'08).☆20Aug 7, 2014Updated 11 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".☆95Feb 6, 2026Updated last month
- Code for 'Geospatial Entity Resolution' paper (WWW 2022)☆19Apr 27, 2023Updated 2 years ago
- ☆43Aug 15, 2023Updated 2 years ago
- 初步版IDEA使用指南☆23Jan 18, 2019Updated 7 years ago
- ☆24Mar 16, 2025Updated last year
- ☆33Oct 27, 2025Updated 4 months ago
- ☆14Aug 7, 2023Updated 2 years ago
- ☆13Oct 23, 2024Updated last year
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆21May 30, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- This spider get infomation from ahu-jwc's website to get more convenience.☆12Dec 1, 2017Updated 8 years ago
- Linq's Function Calling Agent utilizing DeepSeek-R1☆29Feb 5, 2025Updated last year
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆28Feb 5, 2025Updated last year
- TBD☆50Mar 13, 2026Updated last week
- EmbedDB Embedded Database for IoT and Sensors Supporting Key-Value and Relational Data☆45Updated this week
- Official Implementation of DiffCLIP: Differential Attention Meets CLIP☆54Mar 12, 2025Updated last year
- A Web-based MP4 File Inspector. Powered by Rust, Vue and Web Assembly!☆36Mar 18, 2026Updated last week