✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Models".
☆18Mar 13, 2025Updated 11 months ago
Alternatives and similar repositories for Uncertainty-o
Users that are interested in Uncertainty-o are comparing it to the libraries listed below
Sorting:
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆48Mar 18, 2025Updated 11 months ago
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- This is the official implementation of "Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation" (Accepted at AC…☆14Aug 24, 2024Updated last year
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning☆15Dec 12, 2023Updated 2 years ago
- ☆16Jan 13, 2024Updated 2 years ago
- Codes, data, and baselines for CIKM 2023 Long Paper "Dual Intents Graph Modeling for User-centric Group Discovery"☆17Oct 22, 2023Updated 2 years ago
- The official repo of "Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark"☆21Jun 5, 2025Updated 9 months ago
- [ICLR 2020] Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma, "I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifie…☆20Dec 30, 2021Updated 4 years ago
- [AAAI 2025] Enhance Vision-Language Alignment with Noise☆25Dec 19, 2024Updated last year
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Sep 13, 2022Updated 3 years ago
- a training-free approach to accelerate ViTs and VLMs by pruning redundant tokens based on similarity☆43May 24, 2025Updated 9 months ago
- ☆37Oct 11, 2022Updated 3 years ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- [ICLR 2025] Breaking Mental Set to Improve Reasoning through Diverse Multi-Agent Debate☆18Apr 22, 2025Updated 10 months ago
- GraphSleepNet: Adaptive Spatial-Temporal Graph Convolutional Networks for Sleep Stage Classification☆12Jul 24, 2020Updated 5 years ago
- Tracked Vehicle Retrieval by NL Challenge in the 2023 AI City Challenge.☆36Jan 19, 2023Updated 3 years ago
- 复旦研究生抢课脚本☆10Feb 14, 2022Updated 4 years ago
- A collection for basic machine learning and data mining model implementations, in Python, mainly referencing the books: *Machine Learning…☆13Jul 15, 2021Updated 4 years ago
- EARAM for fake news detection☆13Dec 30, 2025Updated 2 months ago
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆52Jul 11, 2025Updated 7 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- 基于numpy的smpl算法☆12Aug 25, 2021Updated 4 years ago
- Code for AAAI2024 paper: Towards Evidential and Class Separable Open Set Object Detection☆12Dec 23, 2023Updated 2 years ago
- [ACL 2023] Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation.☆10Dec 19, 2024Updated last year
- Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training☆11Jan 23, 2024Updated 2 years ago
- Source code of "Multimodal Matching-aware Co-attention Networks with Mutual Knowledge Distillation for Fake News Detection"☆13Nov 17, 2023Updated 2 years ago
- IPO: Interpretable Prompt Optimization for Vision-Language Models(NeurIPS 2024)☆15Mar 4, 2025Updated last year
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆110Jan 26, 2025Updated last year
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆296Mar 13, 2024Updated last year
- [ACMMM UAVM 2025] 🌍🚗 VICI: VLM-Instructed Cross-view Image-localisation 📡🗺️☆17Feb 4, 2026Updated last month
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- A self-adaptive and class-balanced approach to improve deep neural network performance in the presence of noisy labels☆19Jul 2, 2024Updated last year
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆18Jun 19, 2025Updated 8 months ago
- [CVPR 2025] Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation☆26Dec 9, 2025Updated 3 months ago
- [SIGKDD 2024] Rethinking Fair Graph Neural Networks from Re-balancing☆10Jul 15, 2024Updated last year
- Continual Evidential Deep Learning ICCVW 2023☆14Nov 3, 2023Updated 2 years ago
- ☆29Dec 4, 2025Updated 3 months ago
- MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)☆13Apr 17, 2024Updated last year
- ☆19Mar 31, 2025Updated 11 months ago