Ziwei-Zheng / LVLM-Stethoscope
A library of visualization tools for the interpretability and hallucination analysis of large vision-language models (LVLMs).
☆20Updated 3 weeks ago
Alternatives and similar repositories for LVLM-Stethoscope:
Users that are interested in LVLM-Stethoscope are comparing it to the libraries listed below
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆13Updated 3 weeks ago
- [IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition☆45Updated 8 months ago
- [ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.☆65Updated last year
- ☆51Updated 2 years ago
- Official implementation of Dynamic Perceiver☆42Updated last year
- Official repository of Uni-AdaFocus (TPAMI 2024).☆31Updated last month
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆66Updated 3 months ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆90Updated last month
- Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Pekin…☆67Updated 2 months ago
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆92Updated last year
- ☆88Updated last year
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆31Updated 3 months ago
- [IEEE TIP] Fine-grained Recognition with Learnable Semantic Data Augmentation☆27Updated last year
- [NeurIPS'22] This is an official implementation for "Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning".☆176Updated last year
- Instruction Tuning in Continual Learning paradigm☆38Updated last month
- [NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks☆24Updated last year
- [TPAMI reviewing] Towards Visual Grounding: A Survey☆42Updated this week
- [arXiv] Cross-Modal Adapter for Text-Video Retrieval☆55Updated 2 years ago
- Code for "DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets", accepted at Neurips 2023 (Main confer…☆21Updated 9 months ago
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆175Updated last year
- [ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions☆121Updated last month
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆95Updated 7 months ago
- [NeurIPS 2023] Generalized Logit Adjustment☆33Updated 8 months ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆68Updated last year
- [NeurIPS 2023] Rank-DETR for High Quality Object Detection☆88Updated last year
- [ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.☆30Updated 2 weeks ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆79Updated 11 months ago
- ☆27Updated 2 weeks ago
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆70Updated 11 months ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆91Updated last year