itsqyh / Awesome-LMMs-Mechanistic-Interpretability
☆18Updated last month
Alternatives and similar repositories for Awesome-LMMs-Mechanistic-Interpretability:
Users that are interested in Awesome-LMMs-Mechanistic-Interpretability are comparing it to the libraries listed below
- Latest Advances on Modality Priors in Multimodal Large Language Models☆10Updated 2 weeks ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated 2 weeks ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆56Updated 11 months ago
- Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …☆46Updated last month
- A Survey on the Honesty of Large Language Models☆56Updated 3 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆69Updated 4 months ago
- M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆55Updated 3 months ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆27Updated 4 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆36Updated 4 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆33Updated 8 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆44Updated 4 months ago
- ☆67Updated 8 months ago
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆63Updated 3 weeks ago
- ☆23Updated 5 months ago
- A curated list of resources for activation engineering☆46Updated 2 weeks ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆65Updated this week
- ☆33Updated last month
- ☆64Updated 9 months ago
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆29Updated 4 months ago
- ☆22Updated this week
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆85Updated 3 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆65Updated last month
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 5 months ago
- The official code repository for PRMBench.☆68Updated last month
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆12Updated 2 weeks ago
- The reinforcement learning codes for dataset SPA-VL☆31Updated 9 months ago
- ☆13Updated 8 months ago
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)☆20Updated 5 months ago
- Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)☆23Updated 9 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆41Updated 3 months ago