Han-Zongbo/Skip-n

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Han-Zongbo/Skip-n)

Han-Zongbo / Skip-n

This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.

☆15

Alternatives and similar repositories for Skip-n

Users that are interested in Skip-n are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

skylineeeeen / DOTA
View on GitHub
☆17Jul 2, 2026Updated 3 weeks ago
sangminwoo / RITUAL
View on GitHub
Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…
☆14Dec 16, 2024Updated last year
d-ailin / CLIP-Guided-Decoding
View on GitHub
☆18Aug 1, 2024Updated last year
orrzohar / LOVM
View on GitHub
[NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection
☆21Feb 3, 2024Updated 2 years ago
zhangce01 / DeGF
View on GitHub
[ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models
☆26Apr 14, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
GQBBBB / UCI
View on GitHub
☆10Oct 5, 2023Updated 2 years ago
bhrqw / SADA
View on GitHub
CVPR2023: Few-Shot Learning with Visual Distribution Calibration and Cross-Modal Distribution Alignment
☆14May 19, 2023Updated 3 years ago
LaVi-Lab / Visual-Table
View on GitHub
[EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"
☆20Oct 17, 2024Updated last year
BillChan226 / HALC
View on GitHub
[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"
☆115Dec 4, 2024Updated last year
yejipark-m / ConVis
View on GitHub
[AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…
☆25Sep 26, 2024Updated last year
alceubissoto / debiasing-skin
View on GitHub
☆19Jun 15, 2020Updated 6 years ago
QingyangZhang / TEMPO
View on GitHub
Scaling Test-time Training for LLM Reasoning
☆27Apr 14, 2026Updated 3 months ago
opendatalab / CLIP-Parrot-Bias
View on GitHub
ECCV2024_Parrot Captions Teach CLIP to Spot Text
☆66Sep 6, 2024Updated last year
YilongLv / AID
View on GitHub
☆12May 30, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Shengcao-Cao / groundLMM
View on GitHub
Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision
☆47Oct 19, 2025Updated 9 months ago
syp2ysy / prompt-SelF
View on GitHub
[TIP] Exploring Effective Factors for Improving Visual In-Context Learning
☆21Jul 2, 2025Updated last year
DAMO-NLP-SG / VCD
View on GitHub
[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
☆410Oct 7, 2024Updated last year
sxzrt / CIFAR-10-W
View on GitHub
CIFAR-10-Warehouse: Towards Broad and More Realistic Testbeds in Model Generalization Analysis
☆18Jul 15, 2024Updated 2 years ago
zlab-princeton / UEval
View on GitHub
UEval: A Benchmark for Unified Multimodal Generation
☆24Apr 20, 2026Updated 3 months ago
boyazeng / understand_bias
View on GitHub
Code release for "Understanding Bias in Large-Scale Visual Datasets"
☆25Dec 4, 2024Updated last year
RUCAIBox / POPE
View on GitHub
The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
☆266Aug 21, 2025Updated 11 months ago
KejiaZhang-Robust / VAP
View on GitHub
[NeurIPS 2025] Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
☆38Sep 21, 2025Updated 10 months ago
UCSB-AI / MMWorld
View on GitHub
Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"
☆28Jul 15, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
RLHF-V / RLHF-V
View on GitHub
[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
☆310Sep 11, 2024Updated last year
CRIPAC-DIG / LogicCheckGPT
View on GitHub
[ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…
☆25Jan 31, 2025Updated last year
X-PLUG / mPLUG-HalOwl
View on GitHub
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
☆100Jan 29, 2024Updated 2 years ago
yfzhang114 / SliME
View on GitHub
✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models
☆163Dec 26, 2024Updated last year
bronyayang / HallE_Control
View on GitHub
HallE-Control: Controlling Object Hallucination in LMMs
☆32Apr 10, 2024Updated 2 years ago
hasanar1f / PAINT
View on GitHub
[CVPR 2025 Workshop] PAINT (Paying Attention to INformed Tokens) is a plug-and-play framework that intervenes in the self-attention of th…
☆20Updated this week
sangminwoo / AvisC
View on GitHub
[ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…
☆25Jul 21, 2024Updated 2 years ago
Kwai-YuanQi / TaskGalaxy
View on GitHub
Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types
☆32Jul 16, 2025Updated last year
aaronserianni / attention-iou
View on GitHub
[CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps
☆13Mar 26, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
wdchenxyz / CNN2
View on GitHub
Code for "CNN^2: Viewpoint Generalization via a Binocular Vision" (NeurIPS 2019)
☆11Aug 7, 2021Updated 4 years ago
lisadunlap / LADS
View on GitHub
Official Implementation of LADS (Latent Augmentation using Domain descriptionS)
☆50Apr 18, 2023Updated 3 years ago
mit-acl / gym-minigrid
View on GitHub
☆16Jun 9, 2020Updated 6 years ago
kai-wen-yang / IDAA
View on GitHub
[ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"
☆10Jul 24, 2022Updated 4 years ago
ParadoxZW / LLaVA-UHD-Better
View on GitHub
A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo
☆35Aug 12, 2024Updated last year
snap-stanford / zeroc
View on GitHub
ZeroC is a neuro-symbolic method that trained with elementary visual concepts and relations, can zero-shot recognize and acquire more com…
☆33May 8, 2023Updated 3 years ago
Ziwei-Zheng / Nullu
View on GitHub
Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection
☆63Mar 13, 2025Updated last year