rajesh-lab / symile
Symile is a flexible, architecture-agnostic contrastive loss that enables training modality-specific representations for any number of modalities.
β32Updated last month
Alternatives and similar repositories for symile:
Users that are interested in symile are comparing it to the libraries listed below
- [NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancyβ66Updated last year
- I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)β19Updated 5 months ago
- [CVPR 2025] MicroVQA eval and π€RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"β¦β20Updated last month
- [NeurIPS 2023, ICMI 2023] Quantifying & Modeling Multimodal Interactionsβ72Updated 5 months ago
- More dimensions = More funβ22Updated 8 months ago
- MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistantsβ29Updated 3 months ago
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Groundingβ49Updated 3 weeks ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models".β38Updated 5 months ago
- "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"β16Updated 2 months ago
- MultiModN β Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)β33Updated last year
- [ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Modelsβ96Updated 8 months ago
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorchβ97Updated last year
- Holistic evaluation of multimodal foundation modelsβ47Updated 8 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]β54Updated 4 months ago
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Modelsβ75Updated 7 months ago
- This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), httpsβ¦β18Updated 9 months ago
- β45Updated 3 months ago
- Official repo for MindGPTβ31Updated 9 months ago
- β43Updated 6 months ago
- β43Updated this week
- Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networksβ28Updated 2 years ago
- β11Updated 3 weeks ago
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoningβ36Updated last week
- Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementationβ103Updated 2 months ago
- Official Implementation of DiffCLIP: Differential Attention Meets CLIPβ26Updated last month
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmindβ60Updated 7 months ago
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"β17Updated last year
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"β81Updated last year
- β41Updated 9 months ago
- β23Updated 5 months ago