[EMNLP 2025 Demo] Extracting internal representations from vision-language models. Beta version.
☆122Apr 25, 2026Updated 2 weeks ago
Alternatives and similar repositories for vlm-lens
Users that are interested in vlm-lens are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repository for BEVANet: Bilateral Efficient Visual Attention Network for Real-time Semantic Segmentation (ICIP 2025 Spotlight Or…☆22Oct 11, 2025Updated 6 months ago
- ☆58Nov 11, 2025Updated 5 months ago
- 🔥 [ICLR 2025] Official Benchmark Toolkits for "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆41Nov 21, 2025Updated 5 months ago
- ☆12Jun 5, 2024Updated last year
- ☆40Feb 4, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Feb 21, 2024Updated 2 years ago
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆37Nov 13, 2024Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated this week
- [IJCAI 2023 workshop]Expanding dataset for 2D medical image segmentation using diffusion models☆15Feb 28, 2023Updated 3 years ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆29Jul 9, 2025Updated 10 months ago
- Image/Instance Retrieval using CLIP, A self supervised Learning Model☆29May 30, 2023Updated 2 years ago
- Official repository for "Vid2World: Crafting Video Diffusion Models to Interactive World Models" (ICLR 2026), https://arxiv.org/abs/2505.…☆54Jan 27, 2026Updated 3 months ago
- ☆12Jun 20, 2023Updated 2 years ago
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆38Dec 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆32Jun 5, 2025Updated 11 months ago
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆19Oct 6, 2025Updated 7 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Mar 18, 2026Updated last month
- ☆11Jan 13, 2022Updated 4 years ago
- [CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant☆30Dec 2, 2025Updated 5 months ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆56Dec 28, 2025Updated 4 months ago
- [TPAMI 2026] Breaking Barriers, Localizing Saliency: A Large-scale Benchmark and Baseline for Condition-Constrained Salient Object Detect…☆30Dec 12, 2025Updated 4 months ago
- DINO-based perceptual losses and FDD feature extraction☆27Jan 7, 2026Updated 4 months ago
- ☆44May 9, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Sep 1, 2024Updated last year
- First Latency-Aware Competitive LLM Agent Benchmark☆28Jun 3, 2025Updated 11 months ago
- Multimodal grounded language dataset☆11Dec 14, 2021Updated 4 years ago
- Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions☆18May 1, 2025Updated last year
- A comprehensive JAX/NNX library for diffusion and flow matching generative algorithms, featuring DiT (Diffusion Transformer) and its vari…☆145Oct 16, 2025Updated 6 months ago
- ChartSum is a large scale benchmark for automatic chart to text summarization☆11Jul 20, 2023Updated 2 years ago
- Unofficial PyTorch implementation of DALL-E 2 by OpenAI☆10Apr 6, 2022Updated 4 years ago
- [ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron☆31Apr 30, 2025Updated last year
- ☆39Dec 18, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆15Dec 16, 2023Updated 2 years ago
- [ICCV 2025] Identity Preserving 3D Head Stylization with Multiview Score Distillation☆16Jun 25, 2025Updated 10 months ago
- DataSciCamp — Data Science Challenge / Competition Deadlines☆15May 26, 2020Updated 5 years ago
- This is a community implementation for the paper EcoTTA: Memory-Efficient Continual Test-time Adaptation via Self-distilled Regularizatio…☆37Aug 4, 2023Updated 2 years ago
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆60Feb 4, 2026Updated 3 months ago
- [ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing☆27Jan 27, 2026Updated 3 months ago
- ☆24Oct 30, 2025Updated 6 months ago