claws-lab/projection-in-MLLMs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/claws-lab/projection-in-MLLMs)

claws-lab / projection-in-MLLMs

Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'

☆18

Alternatives and similar repositories for projection-in-MLLMs

Users that are interested in projection-in-MLLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

whongzhong / MMHalSnowball
View on GitHub
Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…
☆18Aug 12, 2024Updated last year
FreedomIntelligence / TRIM
View on GitHub
We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…
☆22Jan 11, 2026Updated 6 months ago
WillDreamer / Awesome-MLLM-Reasoning
View on GitHub
Recent Advances on MLLM's Reasoning Ability
☆26Apr 11, 2025Updated last year
ytaek-oh / fsc-clip
View on GitHub
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
☆23Oct 8, 2024Updated last year
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆12Jul 28, 2025Updated 11 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
claws-lab / XLingEval
View on GitHub
Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"
☆19May 14, 2026Updated 2 months ago
CUHK-AIM-Group / MCPL
View on GitHub
MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)
☆13Apr 17, 2024Updated 2 years ago
guanjinquan / CXRTrek
View on GitHub
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight
☆13May 26, 2025Updated last year
rajpurkarlab / ReXKG
View on GitHub
☆17Sep 23, 2024Updated last year
Richar-Du / Virgo
View on GitHub
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆20May 27, 2025Updated last year
sangminwoo / awesome-token-redundancy-reduction
View on GitHub
😎 Awesome papers on token redundancy reduction
☆14Mar 12, 2025Updated last year
umd-huang-lab / Mementos
View on GitHub
☆32Feb 8, 2024Updated 2 years ago
zwq2018 / Multi-modal-Self-instruct
View on GitHub
The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…
☆85Jan 27, 2025Updated last year
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cwangrun / CheXficient
View on GitHub
CheXficient
☆15Jun 28, 2026Updated 3 weeks ago
PaulCCCCCCH / Multimodal-Categorization-of-Crisis-Events-in-Social-Media
View on GitHub
An unofficial implementation of the CVPR 2020 paper Multimodal Categorization of Crisis Events in Social Media
☆17Dec 8, 2021Updated 4 years ago
RupertLuo / VoCoT
View on GitHub
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
☆79Jul 13, 2024Updated 2 years ago
wjhou / ICon
View on GitHub
[EMNLP 2024 Findings] ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation
☆19Dec 11, 2024Updated last year
jonathan-roberts1 / charting-new-territories
View on GitHub
Accompanying repo for CVPRW'24: Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs
☆27May 24, 2025Updated last year
StanfordMIMI / MedVAL
View on GitHub
Toward Expert-Level Medical Text Validation with Language Models
☆18Oct 23, 2025Updated 9 months ago
HKUSTGZ-ML4Health-Lab / Med-Scout
View on GitHub
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training
☆16Feb 8, 2026Updated 5 months ago
borioda / gnss_jamming_demo
View on GitHub
Demonstration of the effects of interference mitigation techniques on GNSS signal acquisition.
☆17Apr 21, 2021Updated 5 years ago
yangyan22 / Medical-Report-Generation-TriNet
View on GitHub
Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation
☆18Nov 13, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
chancharikmitra / CCoT
View on GitHub
[CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"
☆142Jun 20, 2024Updated 2 years ago
SpeechEE / SpeechEE
View on GitHub
☆11Aug 20, 2025Updated 11 months ago
SLDGroup / LBP-WHT
View on GitHub
☆13Apr 27, 2024Updated 2 years ago
UCSB-AI / ProbMed
View on GitHub
Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal M…
☆25May 12, 2026Updated 2 months ago
MedHK23 / IMT-CXR
View on GitHub
☆20Jan 3, 2025Updated last year
ASGMVLP / ASGMVLP_CODE
View on GitHub
The repo of ASGMVLP
☆19Jan 16, 2026Updated 6 months ago
wenhuang2000 / VHTest
View on GitHub
VHTest
☆16Oct 31, 2024Updated last year
wjpoom / SPEC
View on GitHub
[CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"
☆52Jun 16, 2025Updated last year
haizhongzheng / LTE
View on GitHub
☆13Oct 13, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
abachaa / VQA-Med-2021
View on GitHub
VQA-Med 2021
☆24May 13, 2026Updated 2 months ago
zzma2 / medical-llm-reasoning-survey
View on GitHub
A curated list of medical reasoning research on large language models, organized by modality, technique, application, and benchmark.
☆19Oct 17, 2025Updated 9 months ago
rayruizhiliao / mutual_info_img_txt
View on GitHub
Joint learning of images and text via maximization of mutual information
☆19Dec 14, 2021Updated 4 years ago
starmpcc / REMed
View on GitHub
REMed: Retrieval-Enhanced Medical prediction model
☆24Jan 8, 2025Updated last year
huskydoge / CS2612-Programming-Languages-and-Compilers
View on GitHub
SJTU | CS 2612, Programming Languages and Compilers, Fall 2023
☆13Jan 9, 2024Updated 2 years ago
hyhuang00 / moe_inference
View on GitHub
Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".
☆19Oct 30, 2024Updated last year
YtongXie / X-RGen
View on GitHub
[ACCV2024 (Oral)] Official pytorch implementation of X-RGen
☆18Jan 20, 2025Updated last year