CLCS-SUSTech / FACELinks
☆14Updated last year
Alternatives and similar repositories for FACE
Users that are interested in FACE are comparing it to the libraries listed below
Sorting:
- TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models☆19Updated last year
- AdaMoLE: Adaptive Mixture of LoRA Experts☆38Updated last year
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆58Updated last year
- This is the official implementation of our ICML 2024 paper "MultiMax: Sparse and Multi-Modal Attention Learning""☆21Updated 5 months ago
- Code base for "Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood"☆14Updated 5 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆76Updated 11 months ago
- Training A Small Emotional Vision Language Model for Visual Art Comprehension☆14Updated last year
- Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models☆77Updated 6 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆45Updated 4 months ago
- [ICLR 2026] The official repository for the paper "AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning".☆63Updated last week
- The Social-IQ 2.0 Challenge Release for the Artificial Social Intelligence Workshop at ICCV '23☆36Updated 2 years ago
- MokA: Multimodal Low-Rank Adaptation for MLLMs☆80Updated last month
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆186Updated 4 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆85Updated last year
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆60Updated 7 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆103Updated last year
- [CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…☆62Updated 3 months ago
- [ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation☆46Updated 2 years ago
- [NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?☆43Updated last year
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆32Updated last year
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"☆23Updated last year
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆32Updated 6 months ago
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆74Updated 8 months ago
- Official implementation of paper VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interact…☆43Updated last year
- HallE-Control: Controlling Object Hallucination in LMMs☆31Updated last year
- [ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs☆163Updated last year
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆96Updated 2 months ago
- [NeurIPS'24 spotlight] MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning. [TPAMI'25] MECD+☆45Updated 3 months ago
- Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)☆31Updated last year
- Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images☆18Updated 8 months ago