LG-AI-EXAONE / EXAONEPathLinks

☆17

Alternatives and similar repositories for EXAONEPath

Users that are interested in EXAONEPath are comparing it to the libraries listed below

Sorting:

jmhb0 / microvqa
[CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"…
☆21Updated this week
lucidrains / AMIE-pytorch
Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind
☆65Updated 9 months ago
standardmodelbio / Llama3-Med
☆30Updated 8 months ago
EPFL-VILAB / fm-vision-evals
☆49Updated last week
rajesh-lab / symile
Symile is a flexible, architecture-agnostic contrastive loss that enables training modality-specific representations for any number of mo…
☆36Updated 3 months ago
passing2961 / DialogCC
Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…
☆13Updated last year
m1k2zoo / negbench
Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"
☆26Updated 2 months ago
EPFLiGHT / MultiModN
MultiModN – Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)
☆33Updated last year
HanSolo9682 / CounterCurate
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆18Updated last year
microsoft / x-reasoner
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
☆46Updated 2 months ago
samar-khanna / ExPLoRA
Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"
☆31Updated 9 months ago
top-yun / SPARK
A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.
☆18Updated 6 months ago
ethanbar11 / ssm_2d
More dimensions = More fun
☆22Updated 11 months ago
Google-Health / medsiglip
☆68Updated this week
eric-ai-lab / ComCLIP
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
☆35Updated 10 months ago
alicebizeul / pmae
Code for Principal Masked Autoencoders
☆27Updated 3 months ago
divyam3897 / I2M2
I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)
☆20Updated 8 months ago
kyegomez / BRAVE-ViT-Swarm
Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"
☆26Updated this week
UCDvision / NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"
☆56Updated 10 months ago
google-research / silc
[ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation
☆44Updated 9 months ago
DCDmllm / HyperLLaVA
Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
☆28Updated last year
KAIST-Edlab / Study_Of_VL
KAIST medical VL research group
☆18Updated 6 months ago
kyegomez / PaLM2-VAdapter
Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…
☆16Updated 8 months ago
AmayaGS / MUSTANG
Multi-stain graph self attention multiple instance learning for histopathology Whole Slide Images - BMVC 2023
☆13Updated 4 months ago
Stanford-AIMI / RaVL
[NeurIPS 2024] RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models
☆23Updated 8 months ago
lapisrocks / DiscreteAdversarialDistillation
[NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"
☆12Updated last year
ml-jku / MIM-Refiner
A Contrastive Learning Boost from Intermediate Pre-Trained Representations
☆42Updated 9 months ago
facebookresearch / ViP-MAE
This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision
☆36Updated 2 years ago
prometheus-eval / prometheus-vision
[ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…
☆73Updated 9 months ago
SMILE-data / SMILE
SMILE: A Multimodal Dataset for Understanding Laughter
☆13Updated 2 years ago