β37Oct 11, 2022Updated 3 years ago
Alternatives and similar repositories for MAP
Users that are interested in MAP are comparing it to the libraries listed below
Sorting:
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"β18Mar 21, 2023Updated 2 years ago
- πCurated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.β13Feb 7, 2025Updated last year
- ECCV2024: Adversarial Prompt Tuning for Vision-Language Modelsβ31Nov 19, 2024Updated last year
- β¨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Modelβ¦β18Mar 13, 2025Updated 11 months ago
- β19Dec 13, 2023Updated 2 years ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our emβ¦β77Nov 7, 2025Updated 3 months ago
- (ECCV 2022) BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networksβ50Dec 14, 2022Updated 3 years ago
- Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)β138Mar 1, 2024Updated 2 years ago
- [TCSVT 2024] Official PyTorch implementation of the paper "MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Humβ¦β27Jul 22, 2024Updated last year
- β28Aug 14, 2024Updated last year
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)β60May 26, 2024Updated last year
- This is the official repo for Towards Uncertainty-Aware Language Agent.β31Aug 15, 2024Updated last year
- [NeurIPS 2023] The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" acceβ¦β27May 14, 2024Updated last year
- M-HalDetect Dataset Releaseβ27Nov 4, 2023Updated 2 years ago
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024β33Jun 18, 2025Updated 8 months ago
- LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections (NeurIPS 2023)β29Dec 27, 2023Updated 2 years ago
- β35Nov 20, 2023Updated 2 years ago
- β43Mar 31, 2025Updated 11 months ago
- Official PyTorch Implementation for the "RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling" paper!β13Jun 10, 2025Updated 8 months ago
- β81Nov 6, 2023Updated 2 years ago
- [NeurIPS 2025] Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale persoβ¦β73Oct 20, 2025Updated 4 months ago
- Code for the paper "Semi-Conditional Normalizing Flows for Semi-Supervised Learning"β11Mar 30, 2020Updated 5 years ago
- Vision Transformer (ViT) models, with their attention mechanisms, revolutionized computer vision. By merging Class Activation Map (CAM) aβ¦β13Aug 14, 2023Updated 2 years ago
- β11Jul 1, 2022Updated 3 years ago
- Introduction to Machine Learning using scikit-learn and PyTorchβ10Sep 26, 2019Updated 6 years ago
- β11Jun 18, 2023Updated 2 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"β11May 16, 2023Updated 2 years ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our emβ¦β120Nov 26, 2025Updated 3 months ago
- β37Apr 13, 2023Updated 2 years ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learningβ170Sep 26, 2022Updated 3 years ago
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".β13Jan 25, 2025Updated last year
- [π₯ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generationβ23Dec 30, 2025Updated 2 months ago
- β10Mar 6, 2022Updated 3 years ago
- Knowledge-Guided Adaptation of Pathology Foundation Models Improves Cross-domain Generalization and Demographic Fairnessβ17Oct 14, 2025Updated 4 months ago
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasksβ12Sep 1, 2023Updated 2 years ago
- β13Feb 14, 2022Updated 4 years ago
- [ICCV 2021] Multimodal Knowledge Expansionβ10Aug 28, 2021Updated 4 years ago
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"β38Oct 9, 2025Updated 4 months ago
- UQGAN: A Unified Model for Uncertainty Quantification of Deep Classifiers trained via Conditional GANsβ11Apr 13, 2023Updated 2 years ago