Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text matching/retrieval models.
☆16Nov 20, 2025Updated 6 months ago
Alternatives and similar repositories for PC2-NoiseofWeb
Users that are interested in PC2-NoiseofWeb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆15Jun 6, 2024Updated last year
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆133May 12, 2026Updated 2 weeks ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated 11 months ago
- [ACM MM 2024] Pytorch Code for the paper "Robust Variational Contrastive Learning for Partially View-unaligned Clustering"☆15Feb 7, 2026Updated 3 months ago
- ☆11Nov 28, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..☆20Dec 3, 2023Updated 2 years ago
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024☆33Jun 18, 2025Updated 11 months ago
- Pytorch implementation of "Test-time Adaptation for Cross-modal Retrieval with Query Shift".☆34Nov 22, 2025Updated 6 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆83May 12, 2026Updated 2 weeks ago
- ☆28Jun 4, 2023Updated 2 years ago
- ☆82Nov 6, 2023Updated 2 years ago
- Official PyTorch Implementation of ParGo: Bridging Vision-Language with Partial and Global Views. (AAAI 2025)☆16Jan 7, 2025Updated last year
- ☆12Feb 2, 2024Updated 2 years ago
- a tiny project to test the effectiveness of video QA through RAG techniques and multimodal LLMs☆15Jun 2, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆103Nov 20, 2025Updated 6 months ago
- ☆13Sep 26, 2024Updated last year
- This is my attempt at the ActivityNet Challenge 2017. Thanks to the organizers for providing the boilerplate code and annotated datasets.…☆10Jul 19, 2017Updated 8 years ago
- ☆10Apr 10, 2019Updated 7 years ago
- [EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering☆18Oct 31, 2024Updated last year
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆39Jan 25, 2024Updated 2 years ago
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21☆70Apr 5, 2026Updated last month
- AdaTriplet loss & automargin method☆19Mar 8, 2022Updated 4 years ago
- ☆13Jul 13, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Mar 21, 2024Updated 2 years ago
- ☆18Feb 20, 2024Updated 2 years ago
- Code for BYOP [CVPR 2023]☆11Sep 25, 2023Updated 2 years ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated 2 years ago
- Source code for TCSVT paper “Deep Semantic-Aware Proxy Hashing for Multi-Label Cross-Modal Retrieval”☆20Nov 30, 2025Updated 5 months ago
- deep fashion CNN in Keras☆10Feb 24, 2017Updated 9 years ago
- Chinese Vision-Language Understanding Evaluation☆23Dec 26, 2024Updated last year
- ☆12May 3, 2024Updated 2 years ago
- Source code for the published paper "Self-Weighted Multiview Clustering with Multiple Graphs" IJCAI 2017.☆17Dec 4, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV 2023] Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding☆14Oct 2, 2024Updated last year
- Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".☆28Sep 15, 2025Updated 8 months ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆18Mar 18, 2026Updated 2 months ago
- ☆12Mar 28, 2024Updated 2 years ago
- [ICLR 2025] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval☆26Feb 13, 2025Updated last year
- ☆16Apr 11, 2026Updated last month
- This is the official implementation for the paper: Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models☆19Sep 11, 2024Updated last year