Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text matching/retrieval models.
☆16Nov 20, 2025Updated 6 months ago
Alternatives and similar repositories for PC2-NoiseofWeb
Users that are interested in PC2-NoiseofWeb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆15Jun 6, 2024Updated 2 years ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆136May 23, 2026Updated 3 weeks ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated last year
- [ACM MM 2024] Pytorch Code for the paper "Robust Variational Contrastive Learning for Partially View-unaligned Clustering"☆16Feb 7, 2026Updated 4 months ago
- Uncertainty-Guided Noisy Correspondence Learning for Efficient Cross-Modal Matching (ACM SIGIR 2024, Pytorch Code)☆22Apr 16, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..☆20Dec 3, 2023Updated 2 years ago
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024☆33Jun 18, 2025Updated 11 months ago
- Pytorch implementation of "Test-time Adaptation for Cross-modal Retrieval with Query Shift".☆35Nov 22, 2025Updated 6 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆83May 24, 2026Updated 3 weeks ago
- ☆29Jun 4, 2023Updated 3 years ago
- ☆82Nov 6, 2023Updated 2 years ago
- Official PyTorch Implementation of ParGo: Bridging Vision-Language with Partial and Global Views. (AAAI 2025)☆16Jan 7, 2025Updated last year
- ☆12Feb 2, 2024Updated 2 years ago
- "Roll with the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning" by Yue Duan (AAAI 2024…☆13Nov 20, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- a tiny project to test the effectiveness of video QA through RAG techniques and multimodal LLMs☆15Jun 2, 2024Updated 2 years ago
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆105Nov 20, 2025Updated 6 months ago
- ☆13Sep 26, 2024Updated last year
- Code for ICLR'24 workshop ME-FoMo-How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation☆38Oct 18, 2024Updated last year
- [EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering☆18Oct 31, 2024Updated last year
- Crossmodal Translation based Meta Weight Adaption for Robust Image-Text Sentiment Analysis☆15May 16, 2024Updated 2 years ago
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆39Jan 25, 2024Updated 2 years ago
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21☆70Apr 5, 2026Updated 2 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Implementation of Visual Abstraction: A Plug-and-Play Approach for Text-Visual Retrieval☆26Jul 14, 2025Updated 11 months ago
- ☆14Jul 13, 2024Updated last year
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Mar 21, 2024Updated 2 years ago
- ☆18Feb 20, 2024Updated 2 years ago
- Code for BYOP [CVPR 2023]☆12Sep 25, 2023Updated 2 years ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated 2 years ago
- Source code for TCSVT paper “Deep Semantic-Aware Proxy Hashing for Multi-Label Cross-Modal Retrieval”☆21Nov 30, 2025Updated 6 months ago
- This repo contains the code and data of "Graph Matching with Bi-level Noisy Correspondence".☆20Jul 28, 2023Updated 2 years ago
- Chinese Vision-Language Understanding Evaluation☆23Dec 26, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12May 3, 2024Updated 2 years ago
- Source code for the published paper "Self-Weighted Multiview Clustering with Multiple Graphs" IJCAI 2017.☆17Dec 4, 2017Updated 8 years ago
- [ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling☆17Jun 6, 2024Updated 2 years ago
- [ICCV 2023] Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding☆14Oct 2, 2024Updated last year
- Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".☆28Sep 15, 2025Updated 9 months ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆18Mar 18, 2026Updated 2 months ago
- ☆17Apr 30, 2024Updated 2 years ago