Ruiyang-061X/Awesome-MLLM-Uncertainty

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ruiyang-061X/Awesome-MLLM-Uncertainty)

Ruiyang-061X / Awesome-MLLM-Uncertainty

✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).

☆59

Alternatives and similar repositories for Awesome-MLLM-Uncertainty

Users that are interested in Awesome-MLLM-Uncertainty are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Oli-iver / Depth-BID
View on GitHub
Offical repo for ECCV 2024: Depth-Aware Blind Image Decomposition for Real-World Weather Recovery
☆13Mar 7, 2024Updated 2 years ago
Ruiyang-061X / LiSe
View on GitHub
[ECCV'24] Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.
☆40Sep 3, 2024Updated last year
Zeus1037 / SEED
View on GitHub
SEED Dataset
☆29Jun 3, 2025Updated last year
Ruiyang-061X / Uncertainty-o
View on GitHub
✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…
☆21Mar 13, 2025Updated last year
Ruiyang-061X / VL-Uncertainty
View on GitHub
🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".
☆56Mar 18, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
MultimodalGeo / GeoText-1652
View on GitHub
An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
☆118Jul 7, 2026Updated 3 weeks ago
Ruiyang-061X / UA3D
View on GitHub
[ICCV'25] "Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection".
☆26Jan 12, 2026Updated 6 months ago
Jahawn-Wen / CAMeL-reID
View on GitHub
[IEEE Transactions on Information Forensics and Security'25] Pytorch implementation of CAMeL: Cross-modality Adaptive Meta-Learning for T…
☆17Jan 5, 2026Updated 6 months ago
lingyuliu / DQ_Transformer
View on GitHub
Official Repo for Look, Compare and Draw: Differential Query Transformer for Automatic Oil Painting
☆17Mar 31, 2026Updated 3 months ago
refkxh / C-Instructor
View on GitHub
[ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting
☆31Dec 16, 2024Updated last year
minjoong507 / BM-DETR
View on GitHub
[WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"
☆16Feb 24, 2025Updated last year
YcZhangSing / AMD
View on GitHub
[CVPR 2026] Code for "The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts"
☆20Jun 13, 2026Updated last month
roi-hpi / IDK-token-tuning
View on GitHub
☆16Jul 17, 2025Updated last year
ffmpbgrnn / improved_traj_cuda
View on GitHub
GPU implementation of improved dense trajectory
☆10Apr 14, 2015Updated 11 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YilinZhang107 / Multi-view-Datasets
View on GitHub
This repository contains some of the multi-view datasets that are often used in our research.
☆20Jan 1, 2025Updated last year
ml-stat-Sustech / CLIP_Calibration
View on GitHub
[ICML'24] Open-Vocabulary Calibration for Fine-tuned CLIP
☆18Jun 14, 2024Updated 2 years ago
kerner-lab / Mars-Bench
View on GitHub
Mars-Bench is a standardized benchmark for evaluating vision models on Martian surface and orbital imagery, covering 20 datasets across c…
☆18Dec 5, 2025Updated 7 months ago
MiaoXiong2320 / ProximityBias-Calibration
View on GitHub
☆19Nov 11, 2023Updated 2 years ago
xuyang-liu16 / hermes-code-bridge
View on GitHub
Use Hermes Agent as the control plane for local coding agents like Codex, Kimi Code, Claude Code, OpenCode, and Gemini CLI.
☆25Jul 22, 2026Updated last week
knightyxp / DGL
View on GitHub
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
☆49Oct 14, 2024Updated last year
Monoxide-Chen / uncertainty_retrieval
View on GitHub
ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization
☆74Jan 30, 2024Updated 2 years ago
reds-lab / CLIP-MIA
View on GitHub
This is an official repository for Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study (ICCV2023…
☆26Sep 29, 2023Updated 2 years ago
Line-Kite / GraphLayoutLM
View on GitHub
☆14Sep 6, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AtsuMiyai / GL-MCM
View on GitHub
[IJCV2025] https://arxiv.org/abs/2304.04521
☆16Jan 22, 2025Updated last year
HuiGuanLab / RaTSG
View on GitHub
This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"
☆13Aug 22, 2025Updated 11 months ago
jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness
View on GitHub
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
☆832Jun 5, 2026Updated last month
Shuyu-XJTU / APTM
View on GitHub
The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"
☆174Jul 6, 2026Updated 3 weeks ago
chenxn2020 / GOSE
View on GitHub
[Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"
☆17Dec 1, 2023Updated 2 years ago
BIT-DA / GDCAN
View on GitHub
[TPAMI 2021] Code release for "Generalized Domain Conditioned Adaptation Network" https://arxiv.org/abs/2103.12339
☆45Jun 22, 2022Updated 4 years ago
zhenyuw16 / combatnoise
View on GitHub
code for "Combating Noise: Semi-supervised Learning by Region Uncertainty Quantification"
☆10Mar 19, 2022Updated 4 years ago
liupei101 / MIREL
View on GitHub
Weakly-Supervised Residual Evidential Learning for Multi-Instance Uncertainty Estimation (ICML 2024)
☆15Jul 19, 2024Updated 2 years ago
allenai / persona-bias
View on GitHub
☆29May 6, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zhaozhengChen / LPCAM
View on GitHub
The official code of CVPR 2023 paper (Extracting Class Activation Maps from Non-Discriminative Features as well).
☆31Mar 22, 2023Updated 3 years ago
thomassutter / mmvmvae
View on GitHub
☆14Oct 30, 2024Updated last year
Mr-Neko / JM3D
View on GitHub
The offical implemention of JM3D.
☆31Apr 8, 2026Updated 3 months ago
AtsuMiyai / UPD
View on GitHub
[ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models
☆82Mar 6, 2026Updated 4 months ago
ErgastiAlex / MARS
View on GitHub
☆37Mar 28, 2025Updated last year
NEU-REAL / 4D-CS
View on GitHub
☆16Jun 20, 2024Updated 2 years ago
lntzm / MESM
View on GitHub
The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)
☆32Mar 29, 2024Updated 2 years ago