[EMNLP 2025 Demo] Extracting internal representations from vision-language models. Beta version.
☆117Mar 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for vlm-lens
Users that are interested in vlm-lens are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 18, 2024Updated last year
- ☆37Feb 4, 2026Updated last month
- 🔥 [ICLR 2025] Official Benchmark Toolkits for "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆40Nov 21, 2025Updated 4 months ago
- Image/Instance Retrieval using CLIP, A self supervised Learning Model☆29May 30, 2023Updated 2 years ago
- Official repository for "Vid2World: Crafting Video Diffusion Models to Interactive World Models" (ICLR 2026), https://arxiv.org/abs/2505.…☆47Jan 27, 2026Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆57Jun 23, 2025Updated 9 months ago
- ☆12Jun 20, 2023Updated 2 years ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆53Dec 28, 2025Updated 3 months ago
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆38Dec 5, 2024Updated last year
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆32Jun 5, 2025Updated 9 months ago
- Generation of Space Boundaries based on IFC files for Building Simulation☆16Jan 24, 2023Updated 3 years ago
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆19Oct 6, 2025Updated 5 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Mar 18, 2026Updated last week
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆19Nov 28, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- DINO-based perceptual losses and FDD feature extraction☆26Jan 7, 2026Updated 2 months ago
- ☆43Jan 27, 2026Updated 2 months ago
- Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions☆18May 1, 2025Updated 10 months ago
- ☆14Jan 9, 2026Updated 2 months ago
- [ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron☆30Apr 30, 2025Updated 11 months ago
- Static code analysis for VE280 projects☆17Jul 12, 2023Updated 2 years ago
- This is a community implementation for the paper EcoTTA: Memory-Efficient Continual Test-time Adaptation via Self-distilled Regularizatio…☆37Aug 4, 2023Updated 2 years ago
- [ICCVW2025] V-RoAst: A New Dataset for Visual Road Assessment☆11Dec 17, 2025Updated 3 months ago
- [ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing☆27Jan 27, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆38Nov 27, 2025Updated 4 months ago
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆57Feb 4, 2026Updated last month
- ☆34Apr 8, 2025Updated 11 months ago
- ☆28Dec 17, 2025Updated 3 months ago
- Does patch ordering affect context-limited vision transformers?☆17Oct 10, 2025Updated 5 months ago
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆16Mar 31, 2025Updated 11 months ago
- Retargeting of whole-body human motion to humanoid robots for dexterous manipulation of articulated objects.☆26Jan 28, 2026Updated 2 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding. Accepted to ICLR 2026.☆61Aug 19, 2025Updated 7 months ago
- ☆22Sep 16, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Text-guided 3D texture generation using training-free multi-diffusion in UV space.☆14Apr 7, 2025Updated 11 months ago
- ☆15Jul 18, 2022Updated 3 years ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 4 months ago
- [ECCV 2024] Teach CLIP to Develop a Number Sense for Ordinal Regression☆19Apr 1, 2025Updated 11 months ago
- Source code of paper "Systematic Assessment of Factual Knowledge in Large Language Models" - EMNLP Findings 2023☆17Mar 17, 2026Updated last week
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 2 years ago