Official Implementation of Visual Abstraction: A Plug-and-Play Approach for Text-Visual Retrieval
☆25Jul 14, 2025Updated 7 months ago
Alternatives and similar repositories for 2025-ICML-VISA
Users that are interested in 2025-ICML-VISA are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆96Nov 20, 2025Updated 3 months ago
- ☆12Feb 2, 2024Updated 2 years ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆120Nov 26, 2025Updated 3 months ago
- Official implementation of "Decoupled Contrastive Multi-View Clustering with High-Order Random Walks", [AAAI 2024].☆23Feb 6, 2024Updated 2 years ago
- Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]☆119Apr 18, 2024Updated last year
- ☆30Feb 2, 2026Updated last month
- Document Scanner with OCR iOS app written in Swift☆10Nov 22, 2021Updated 4 years ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆42May 3, 2025Updated 10 months ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆47Dec 24, 2024Updated last year
- ☆11May 7, 2020Updated 5 years ago
- OF addon for Artoolkit5 (Marker and NFT)☆12Apr 20, 2020Updated 5 years ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- Official implementation of the paper “Endowing Vision-Language Models with System 2 Thinking for Fine-Grained Visual Recognition,” AAAI 2…☆32Jan 30, 2026Updated last month
- ☆18Mar 2, 2025Updated last year
- cmake scripts for cross compilg pcl and its dependencies on Android and iOS☆10Nov 23, 2018Updated 7 years ago
- Code for paper "Rethinking Text-based Protein Understanding: Retrieval or LLM?"☆18Oct 7, 2025Updated 4 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago