Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text matching/retrieval models.
☆14Nov 20, 2025Updated 3 months ago
Alternatives and similar repositories for PC2-NoiseofWeb
Users that are interested in PC2-NoiseofWeb are comparing it to the libraries listed below
Sorting:
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆121Nov 26, 2025Updated 3 months ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- ☆11Nov 28, 2022Updated 3 years ago
- [ACM MM 2024] Pytorch Code for the paper "Robust Variational Contrastive Learning for Partially View-unaligned Clustering"☆14Feb 7, 2026Updated 3 weeks ago
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆15Jun 6, 2024Updated last year
- Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..☆20Dec 3, 2023Updated 2 years ago
- Uncertainty-Guided Noisy Correspondence Learning for Efficient Cross-Modal Matching (ACM SIGIR 2024, Pytorch Code)☆23Feb 18, 2025Updated last year
- ☆27Jun 4, 2023Updated 2 years ago
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024☆33Jun 18, 2025Updated 8 months ago
- Code for ICLR'24 workshop ME-FoMo-How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation☆38Oct 18, 2024Updated last year
- ☆81Nov 6, 2023Updated 2 years ago
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆38Jan 25, 2024Updated 2 years ago
- ☆11Dec 6, 2024Updated last year
- Directed masked autoencoders☆14Feb 20, 2026Updated last week
- Copilot with deepseek and more...☆13Mar 7, 2025Updated 11 months ago
- A collection python tools used to create gguf files and upload to huggingface☆17Updated this week
- Automatic stabilizing and auto-piloting system for RC flying wing☆14Mar 3, 2016Updated 10 years ago
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆96Nov 20, 2025Updated 3 months ago
- Code and data for EMNLP2019 Paper "Uncover the Ground-Truth Relations in Distant Supervision: A Neural Expectation-Maximization Framework…☆10May 24, 2020Updated 5 years ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated last year
- Python client to integrate Cleanlab Codex with your AI Agent☆19Nov 19, 2025Updated 3 months ago
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 3 months ago
- ☆13Sep 26, 2024Updated last year
- Crossmodal Translation based Meta Weight Adaption for Robust Image-Text Sentiment Analysis☆15May 16, 2024Updated last year
- Weakly Supervised Referring Video Object Segmentation with Object-Centric Pseudo-Guidance☆10Aug 17, 2024Updated last year
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- 基于大语言模型的自动综述生成\nAutomatic Review Generation Method based on Large Language Models☆18Jun 22, 2025Updated 8 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated 3 weeks ago
- Finetuning Stable Diffusion from Diffusers☆12Mar 11, 2024Updated last year
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆51Sep 14, 2024Updated last year
- "Roll with the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning" by Yue Duan (AAAI 2024…☆13Nov 20, 2025Updated 3 months ago
- Bert Abstractive Summarization of Online News Discussion Threads☆13Dec 8, 2022Updated 3 years ago
- Gallery for Industry AI demos☆18May 1, 2023Updated 2 years ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 8 months ago
- 🍽meican Robot for reminding to order dinner and data analysis.☆35Jan 2, 2019Updated 7 years ago
- LLaVA-Next for STVG☆18Dec 5, 2025Updated 3 months ago
- ☆13Jul 13, 2024Updated last year
- A Python tool for fetching citations from multiple sources.☆14Apr 30, 2025Updated 10 months ago