Official implementation for the paper"Towards Understanding How Knowledge Evolves in Large Vision-Language Models"
☆33Apr 10, 2025Updated 11 months ago
Alternatives and similar repositories for Vlm-interpretability
Users that are interested in Vlm-interpretability are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2024] PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance☆23Jul 25, 2024Updated last year
- Recursive Visual Programming (ECCV 2024)☆18Nov 20, 2024Updated last year
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆14Dec 16, 2024Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Benchmarking Multi-Image Understanding in Vision and Language Models☆12Jul 29, 2024Updated last year
- ☆12Nov 6, 2024Updated last year
- ☆13Apr 5, 2023Updated 2 years ago
- Spatial Aptitude Training for Multimodal Langauge Models☆25Feb 8, 2026Updated last month
- Code for our ICCV 2023 paper "Parametric Information Maximization for Generalized Category Discovery"☆16Jun 17, 2024Updated last year
- ☆13Sep 14, 2022Updated 3 years ago
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year
- (ECCV2024) Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery (TextGCD)☆22Nov 26, 2025Updated 3 months ago
- Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…☆21Jun 24, 2025Updated 9 months ago
- code and resources for our paper "Achieving Joint Training Accuracy in Continual Learning" in AAAI2025☆14Feb 25, 2025Updated last year
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- (NeurIPS 2024) One-shot Federated Learning via Synthetic Distiller-Distillate Communication☆18Mar 11, 2025Updated last year
- Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"☆92Feb 13, 2026Updated last month
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆27Aug 7, 2025Updated 7 months ago
- Starter notebook and utilities for the Clevr-4 dataset☆16Nov 1, 2023Updated 2 years ago
- ☆14May 15, 2025Updated 10 months ago
- ☆18Feb 19, 2024Updated 2 years ago
- [CVPR'24] Solving the Catastrophic Forgetting Problem in Generalized Category Discovery https://arxiv.org/pdf/2501.05272☆16Dec 24, 2024Updated last year
- rsbuild svg loader☆13Nov 11, 2024Updated last year
- 百度地图坐标拾取工具☆12Jan 27, 2018Updated 8 years ago
- ☆15Oct 27, 2023Updated 2 years ago
- Towards Training-free Open-world Segmentation via Image Prompt Foundation Models,☆18Nov 22, 2024Updated last year
- ☆36Feb 28, 2026Updated 3 weeks ago
- ☆16Mar 20, 2025Updated last year
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆12Apr 29, 2024Updated last year
- This repo has scripts to compare various powerful RL methods☆39Feb 23, 2026Updated last month
- Implementation of 'Attention-guided Feature Fusion for Small Object Detection'☆14Dec 21, 2023Updated 2 years ago
- kafka + structured streaming + phoenix + elasticsearch 基于行为日志实现热门推荐,用户偏好推荐,召回融合策略实现。☆19Sep 5, 2023Updated 2 years ago
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆17Nov 14, 2024Updated last year
- Learnable drift compensation (LDC) reduces semantic drift in continual learning using a trainable projector to map between tasks.☆19Nov 13, 2024Updated last year
- [CVPR'25] Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception☆16Oct 11, 2025Updated 5 months ago
- Official implementation for the WACV 2026 paper "Deepfake Detection that Generalizes Across Benchmarks".☆36Jan 12, 2026Updated 2 months ago
- ☆22Oct 25, 2024Updated last year
- A binary-only coverage-guided fuzzer based on AFL and e9patch☆22Oct 13, 2020Updated 5 years ago
- A self-made NeurIPS poster template, infused with the unique design style of ShanghaiTech.☆15Dec 26, 2023Updated 2 years ago