A lightweight and extensible toolkit for visualizing attention flow in Large Vision-Language Models (LVLMs). It renders token-to-token attention maps, cross-modal attention paths, and layer–head attention dynamics, helping researchers diagnose abnormal attention behaviors.
☆87Apr 14, 2026Updated this week
Alternatives and similar repositories for AttentionLens-LVLM
Users that are interested in AttentionLens-LVLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TSFM-MRE: Minimal Reproducible Experiment for Time-Series Foundation Models in Finance☆18Mar 7, 2026Updated last month
- (CVPR Workshop Best Paper Award) Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustn…☆18Nov 4, 2025Updated 5 months ago
- An Asian Large-Scale Challenging Dataset for DeepFake Detection☆13Jan 23, 2026Updated 2 months ago
- 一个强大的飞书开放平台API集成工具,完整集成FastGPT AI平台,支持飞书知识库全格式自动同步,支持集成飞书机器人(完美支持思考模式、流式输出、引用下载、图片渲染)☆84Dec 21, 2025Updated 3 months ago
- [ICLR2026] Omni-IML: Towards Unified Interpretable Image Manipulation Localization☆37Feb 26, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆36Jul 14, 2025Updated 9 months ago
- ☆17Jul 11, 2025Updated 9 months ago
- OpenDeepResearch:让你保持对于深度研究过程的掌控感。它是一个研究过程可交互的深度研究 Agent,解决了传统 deep research 黑盒不可控、结果偏差难纠正的问题,用户可以在研究过程中随时介入、调整策略,减少跑偏与返工,最终生成高质量研究报告,节省时…☆47Feb 26, 2026Updated last month
- 🎯 A general-purpose protocol stack analysis and debugging tool based on eBPF 🧰☆1,174Apr 3, 2026Updated 2 weeks ago
- Emotion Detection in Audio Files - Speech & Songs. A deep learning model that detects 8 different kinds of emotions - neutral, calm, happ…☆23Apr 3, 2022Updated 4 years ago
- ☆54Jan 30, 2026Updated 2 months ago
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Jun 10, 2021Updated 4 years ago
- ☆18Apr 7, 2025Updated last year
- 基于FastAPI的文本嵌入向量生成API, 处理Embedding+Rerank模型,兼容OpenAI、硅基流动格式☆26Nov 29, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repersitory contains the materials of the SUSTech CS323 Compilers including the slides of lecture and lab, project code, etc... Feel…☆15Oct 27, 2023Updated 2 years ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆17Nov 21, 2025Updated 4 months ago
- [ICLR 26] TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in f…☆595Nov 24, 2025Updated 4 months ago
- KSP Router (Android端使用KSP实现的路由框架,ARouter可快速迁移到KSP Router)☆225Jul 17, 2025Updated 9 months ago
- 南方科技大学计算机系一些课程的课件以及个人笔记☆16Aug 4, 2022Updated 3 years ago
- ☆11May 27, 2023Updated 2 years ago
- 一个LLM领域的文献仓库和中文入门指南。An introduction to some basic concepts in Large Language Model(LLM).☆23Jun 21, 2023Updated 2 years ago
- Official repository for the paper "MICo-150K: A Comprehensive Dataset for Multi-Image Composition".☆131Mar 1, 2026Updated last month
- A true AI agent for pixel-perfect web cloning. Multi-agent architecture built on Claude Agent SDK with 40+ specialized tools. Clones from…☆257Feb 17, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official code for "A Normalized Gaussian Wasserstein Distance for Tiny Object Detection"☆515Jul 20, 2025Updated 8 months ago
- ☆14Aug 14, 2014Updated 11 years ago
- Pytorch Implementation of INR-based codec RECOMBINER (Robust and Enhanced Compression with Bayesian Implicit Neural Representations)☆11Mar 9, 2024Updated 2 years ago
- 南方科技大学2022年春季学期计算机组成原理期末项目 125/100☆11Jun 23, 2022Updated 3 years ago
- something for paper agent☆11Dec 18, 2024Updated last year
- ☆11Aug 15, 2025Updated 8 months ago
- [NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding☆21Aug 23, 2025Updated 7 months ago
- ☆24Nov 19, 2024Updated last year
- CS205 C/C++ Labs and Projects Code☆16Jan 3, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆17Oct 29, 2024Updated last year
- [ICCV 2025] AdsQA: Towards Advertisement Video Understanding Arxiv: https://arxiv.org/abs/2509.08621☆34Oct 30, 2025Updated 5 months ago
- Python3 script to create Voronoi tessellations (mosaic pattern) on images☆10May 25, 2019Updated 6 years ago
- MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation☆44Nov 4, 2025Updated 5 months ago
- This is the official repo for our paper: "Generative Knowledge-Guided Retrieval System for Construction Disclosure Documents Reviewing"☆22Nov 17, 2025Updated 5 months ago
- A Dify tool plugin for encoding and decoding Base64 text and image files.☆14Dec 2, 2025Updated 4 months ago
- Universal memory layer for AI Agents. It provides scalable, extensible, and interoperable memory storage and retrieval to streamline AI a…☆4,130Updated this week