✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).
☆59Apr 2, 2025Updated 11 months ago
Alternatives and similar repositories for Awesome-MLLM-Uncertainty
Users that are interested in Awesome-MLLM-Uncertainty are comparing it to the libraries listed below
Sorting:
- Offical repo for ECCV 2024: Depth-Aware Blind Image Decomposition for Real-World Weather Recovery☆13Mar 7, 2024Updated 2 years ago
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆48Mar 18, 2025Updated 11 months ago
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆18Mar 13, 2025Updated 11 months ago
- [ECCV'24] Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.☆39Sep 3, 2024Updated last year
- SEED Dataset☆28Jun 3, 2025Updated 9 months ago
- Progressive Text-to-3D Generation for Automatic 3D Prototyping (ACM TOMM)☆50Updated this week
- [ICCV'25] "Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection".☆25Jan 12, 2026Updated last month
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆110Jan 26, 2025Updated last year
- This is the official implementation of "Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation" (Accepted at AC…☆14Aug 24, 2024Updated last year
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning☆15Dec 12, 2023Updated 2 years ago
- [WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"☆16Feb 24, 2025Updated last year
- This repository contains some of the multi-view datasets that are often used in our research.☆17Jan 1, 2025Updated last year
- GPU implementation of improved dense trajectory☆10Apr 14, 2015Updated 10 years ago
- 🎨Official Repo for Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation☆55Apr 10, 2025Updated 11 months ago
- 📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.☆13Feb 7, 2025Updated last year
- Interpretable Control Exploration and Counterfactual Explanation (ICE) on StyleGAN☆17Jan 5, 2022Updated 4 years ago
- Official PyTorch codebase for the Modeling Caption Diversity in ContrastiveVision-Language Pretraining paper.☆18Mar 28, 2025Updated 11 months ago
- [IJCV2025] https://arxiv.org/abs/2304.04521☆15Jan 22, 2025Updated last year
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆74Jan 30, 2024Updated 2 years ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆47Oct 14, 2024Updated last year
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆24Jul 21, 2024Updated last year
- [ICLR 2020] Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma, "I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifie…☆20Dec 30, 2021Updated 4 years ago
- ☆19Nov 11, 2023Updated 2 years ago
- This is an official repository for Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study (ICCV2023…☆24Sep 29, 2023Updated 2 years ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- ☆27May 6, 2024Updated last year
- [ICASSP 2023] Official Implementation of ViTASD: Robust Vision Transformer Baselines for Autism Spectrum Disorder Facial Diagnosis☆29Jun 10, 2023Updated 2 years ago
- PIE: Simulating Disease Progression via Progressive Image Editing☆30Oct 19, 2023Updated 2 years ago
- The offical implemention of JM3D.☆31Aug 18, 2025Updated 6 months ago
- This repo contains code and models for the implementation of ViT-DD, a semi-supervised method for detecting driver distractions.☆33Apr 4, 2023Updated 2 years ago
- The official code of CVPR 2023 paper (Extracting Class Activation Maps from Non-Discriminative Features as well).☆31Mar 22, 2023Updated 2 years ago
- 📄 A curated list of visual reasoning papers.☆31Nov 1, 2025Updated 4 months ago
- Official repo for "SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-Supervised Learning", accepted by ICLR 2023.☆22Jan 31, 2023Updated 3 years ago
- Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning☆45Jul 2, 2025Updated 8 months ago
- ☆27Sep 3, 2024Updated last year
- Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models☆812May 21, 2025Updated 9 months ago
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)☆32Mar 29, 2024Updated last year
- 📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).☆986Sep 27, 2025Updated 5 months ago