zifuwan / ONLYView external linksLinks
[ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
☆49Jul 7, 2025Updated 7 months ago
Alternatives and similar repositories for ONLY
Users that are interested in ONLY are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆24Apr 14, 2025Updated 10 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆171Sep 25, 2025Updated 4 months ago
- [AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models☆38Jan 27, 2026Updated 2 weeks ago
- ☆71Jul 28, 2025Updated 6 months ago
- IPO: Interpretable Prompt Optimization for Vision-Language Models(NeurIPS 2024)☆15Mar 4, 2025Updated 11 months ago
- Code for Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models☆27Oct 29, 2024Updated last year
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆61Jul 16, 2024Updated last year
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆37Jan 8, 2025Updated last year
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆50Mar 13, 2025Updated 11 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆133Sep 11, 2025Updated 5 months ago
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆28Nov 24, 2025Updated 2 months ago
- 🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…☆52Jan 22, 2026Updated 3 weeks ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆49Mar 24, 2025Updated 10 months ago
- Research works from Tencent AI Lab regarding self-evolving agents☆82Jan 30, 2026Updated 2 weeks ago
- [CVPR 2024] Code for HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation☆76Oct 11, 2024Updated last year
- Training recipe for SpatialReasoner☆37Sep 21, 2025Updated 4 months ago
- Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.☆36Jul 7, 2025Updated 7 months ago
- ☆23Aug 20, 2024Updated last year
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆92Feb 16, 2025Updated last year
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆66Apr 11, 2025Updated 10 months ago
- [CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models☆26Feb 13, 2025Updated last year
- [NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding☆47Sep 21, 2025Updated 4 months ago
- ☆124Nov 1, 2025Updated 3 months ago
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆396Aug 24, 2024Updated last year
- ☆107Aug 14, 2025Updated 6 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆103Nov 23, 2024Updated last year
- Notes about courses Machine Learning 2025 Spring by Hung-yi Lee☆23Sep 22, 2025Updated 4 months ago
- [NeurIPS 2025] Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models☆63Nov 27, 2025Updated 2 months ago
- [ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs☆163Nov 6, 2024Updated last year
- ☆44Jun 19, 2025Updated 7 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆46Sep 8, 2025Updated 5 months ago
- Visual Instruction Tuning for Qwen2 Base Model☆41Jun 29, 2024Updated last year
- (ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆74Feb 9, 2026Updated last week
- [EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models☆61Feb 4, 2026Updated last week
- [AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…☆44Apr 18, 2025Updated 9 months ago
- MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks (ICML 2023)☆11Aug 15, 2023Updated 2 years ago
- Instituto de Telecomunicações Deep Learning-based Point Cloud Codec☆11Jun 18, 2024Updated last year
- ☆13Jul 3, 2024Updated last year
- 用Kinect2.0读取图像的深度等信息,分割出手部图像。用HOG提取手部图像信息,接着用SVM进行训练。目的是为了识别手势。☆10Jan 8, 2020Updated 6 years ago