Ruiyang-061X / Uncertainty-oView external linksLinks
✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Models".
☆18Mar 13, 2025Updated 11 months ago
Alternatives and similar repositories for Uncertainty-o
Users that are interested in Uncertainty-o are comparing it to the libraries listed below
Sorting:
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆59Apr 2, 2025Updated 10 months ago
- Offical repo for ECCV 2024: Depth-Aware Blind Image Decomposition for Real-World Weather Recovery☆13Mar 7, 2024Updated last year
- [ICCV'25] "Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection".☆25Jan 12, 2026Updated last month
- This is the official implementation of "Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation" (Accepted at AC…☆14Aug 24, 2024Updated last year
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning☆15Dec 12, 2023Updated 2 years ago
- ☆16Jan 13, 2024Updated 2 years ago
- Codes, data, and baselines for CIKM 2023 Long Paper "Dual Intents Graph Modeling for User-centric Group Discovery"☆17Oct 22, 2023Updated 2 years ago
- The official repo of "Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark"☆21Jun 5, 2025Updated 8 months ago
- [ICML 2024] Offical code repo for ICML2024 paper "Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with …☆32Jun 21, 2024Updated last year
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Sep 13, 2022Updated 3 years ago
- a training-free approach to accelerate ViTs and VLMs by pruning redundant tokens based on similarity☆42May 24, 2025Updated 8 months ago
- A collection for basic machine learning and data mining model implementations, in Python, mainly referencing the books: *Machine Learning…☆13Jul 15, 2021Updated 4 years ago
- ☆10Dec 16, 2023Updated 2 years ago
- GraphSleepNet: Adaptive Spatial-Temporal Graph Convolutional Networks for Sleep Stage Classification☆12Jul 24, 2020Updated 5 years ago
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated last year
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆52Jul 11, 2025Updated 7 months ago
- 基于numpy的smpl算法☆12Aug 25, 2021Updated 4 years ago
- [ICML 2024] Official code for Uncertainty Estimation by Density Aware Evidential Deep Learning☆14Jul 14, 2024Updated last year
- 国家税务总局全国增值税发票查验平台(https://inv-veri.chinatax.gov.cn/) 测试查询☆11Jan 3, 2023Updated 3 years ago
- IPO: Interpretable Prompt Optimization for Vision-Language Models(NeurIPS 2024)☆15Mar 4, 2025Updated 11 months ago
- OW-OVD: Unified Open World and Open Vocabulary Object Detection (CVPR 2025)☆23Dec 2, 2024Updated last year
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆296Mar 13, 2024Updated last year
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆17Jun 19, 2025Updated 7 months ago
- ☆10Mar 19, 2024Updated last year
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- ☆28Dec 4, 2025Updated 2 months ago
- A self-adaptive and class-balanced approach to improve deep neural network performance in the presence of noisy labels☆19Jul 2, 2024Updated last year
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆24Jun 17, 2025Updated 8 months ago
- Code for our Source-free Unsupervised Video Domain Adaptation Paper☆13Jan 17, 2025Updated last year
- Official implementation of CVPR2025 paper "Towards Universal AI-Generated Image Detection by Variational Information Bottleneck Network"☆19Oct 31, 2025Updated 3 months ago
- [NeurIPS 2025] Implementation for paper "Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text"☆29Jun 10, 2025Updated 8 months ago
- [ACM MM-2021] WePerson: learning a generalized re-identification model from all-weather virtual data☆11May 23, 2023Updated 2 years ago
- 地图足迹故事,微信小程序☆10May 5, 2022Updated 3 years ago
- For early fire detection, smoke must be detect first. This project create smoke videos to feed deep laerning dataset☆11Apr 25, 2018Updated 7 years ago
- Continual Evidential Deep Learning ICCVW 2023☆14Nov 3, 2023Updated 2 years ago
- MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)☆13Apr 17, 2024Updated last year
- [NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"☆93Oct 21, 2025Updated 3 months ago
- ☆16Jun 5, 2024Updated last year
- Code for our EMNLP 2023 paper - Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Mode…☆15May 5, 2024Updated last year