cvenhoff / vlm-mappingLinks
☆16Updated 2 months ago
Alternatives and similar repositories for vlm-mapping
Users that are interested in vlm-mapping are comparing it to the libraries listed below
Sorting:
- A holistic benchmark for LLM abstention☆48Updated last week
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆149Updated last month
- ☆34Updated 7 months ago
- ☆20Updated last month
- ☆49Updated 2 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆47Updated 3 months ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆22Updated 6 months ago
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆80Updated 7 months ago
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆35Updated 2 weeks ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆73Updated 9 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆70Updated 3 months ago
- This repository contains the code and data for the paper "SelfIE: Self-Interpretation of Large Language Model Embeddings" by Haozhe Chen,…☆51Updated 8 months ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29Updated 3 months ago
- Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.☆39Updated 2 months ago
- A repo for open research on building large reasoning models☆94Updated this week
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆86Updated 2 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆63Updated 6 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆64Updated last year
- ☆40Updated 3 months ago
- Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆214Updated 2 weeks ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆48Updated 4 months ago
- Geometric-Mean Policy Optimization☆68Updated last month
- Reinforcing General Reasoning without Verifiers☆80Updated 2 months ago
- ☆212Updated 6 months ago
- Official Repo for RuleReasoner.☆26Updated 2 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆104Updated 2 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆110Updated 11 months ago
- ☆19Updated 6 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆30Updated 4 months ago
- ☆47Updated 6 months ago