weihao1115/mm-sam

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/weihao1115/mm-sam)

weihao1115 / mm-sam

The official implementation of "Segment Anything with Multiple Modalities".

☆113

Alternatives and similar repositories for mm-sam

Users that are interested in mm-sam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

weihao1115 / cat-sam
View on GitHub
[ECCV 2024 Oral] The official implementation of "CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model".
☆137Sep 1, 2024Updated last year
weihao1115 / MMLU-ProX
View on GitHub
[EMNLP 2025 Main] The official repo of MMLU-ProX benchmark.
☆29Aug 26, 2025Updated 11 months ago
xing0047 / cca-llava
View on GitHub
[NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention
☆67Aug 30, 2025Updated 10 months ago
xiaoaoran / awesome-RSFMs
View on GitHub
Official repo for "Foundation Models for Remote Sensing and Earth Observation: A Survey"
☆51Nov 25, 2024Updated last year
weihao1115 / dynamicvl
View on GitHub
[NeurIPS 2025] The official repo of "DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding".
☆30Feb 7, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ChenHongruixuan / AnyDisasterMapping
View on GitHub
Code Repo for Earth Observation for Disaster Mapping: Benchmarks, Methods, Challenges and Future Perspectives
☆66May 15, 2026Updated 2 months ago
Hengwei-Zhao96 / NcPU
View on GitHub
Noisy-Pair Robust Representation Alignment for Positive-Unlabeled Learning (ICLR 2026)
☆15Feb 24, 2026Updated 5 months ago
xiaoyan07 / SAM_MLoRA
View on GitHub
☆23May 28, 2025Updated last year
xing0047 / rewrite
View on GitHub
[NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation
☆21Jan 3, 2024Updated 2 years ago
91097luke / phileo-bench
View on GitHub
Repo for testing foundation models
☆12Jan 19, 2024Updated 2 years ago
chenxi52 / CMPF
View on GitHub
[IJCV 2026] Official implementation of the paper “CMPF: Harmonizing Cross-Model Prior Fusion for Open-Vocabulary Segmentation”
☆26Jun 15, 2025Updated last year
cliffbb / OEM-Fewshot-Challenge
View on GitHub
☆23Jan 22, 2025Updated last year
niejiahao1998 / IFA
View on GitHub
☆35Jul 10, 2024Updated 2 years ago
ZheningHuang / NHA12D-Crack-Detection-Dataset-and-Comparison-Study
View on GitHub
☆13Apr 10, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mt-cly / SimCMF
View on GitHub
SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality
☆34Nov 25, 2024Updated last year
tianrun-chen / SAM-Adapter-PyTorch
View on GitHub
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
☆1,544May 17, 2026Updated 2 months ago
PolyU-ChenLab / ETBench
View on GitHub
👾 E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)
☆74Jan 20, 2025Updated last year
U-AMC / RA-LLO
View on GitHub
RA-LLO: Robust Adaptive Legged-LiDAR Odometry with Gaussian Process Motion Prior
☆16Jul 5, 2025Updated last year
m-arda-aydn / ITACLIP
View on GitHub
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements [CVPRW 2025]
☆24Jan 31, 2026Updated 5 months ago
likyoo / SemiCD-VL
View on GitHub
The pytorch implementation for "SemiCD-VL: Visual-Language Model Guidance Makes Better Semi-supervised Change Detector"
☆54Dec 19, 2024Updated last year
Junjue-Wang / EarthVQA
View on GitHub
[AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering
☆155Jan 26, 2026Updated 6 months ago
ytaek-oh / fsc-clip
View on GitHub
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
☆23Oct 8, 2024Updated last year
earth-insights / awesome-layout-to-image
View on GitHub
An up-to-date & curated list of awesome layout to image papers, methods & resources.
☆13Jun 28, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sparolab / Joint_ID
View on GitHub
Joint-ID: Transformer-based Joint Image Enhancement and Depth Estimation for Underwater Environments
☆27Mar 11, 2024Updated 2 years ago
ShawnAn-WHU / PIS
View on GitHub
☆17Mar 25, 2024Updated 2 years ago
Junjue-Wang / LoveNAS
View on GitHub
[ISPRS 2024] LoveNAS: Towards Multi-Scene Land-Cover Mapping via Hierarchical Searching Adaptive Network
☆33Dec 1, 2024Updated last year
MiliLab / Text-Before-Vision
View on GitHub
[ICML 2026] Text Before Vision: Staged Knowledge Injection Matters for Agentic RLVR in Ultra-High-Resolution Remote Sensing Understanding
☆16Mar 13, 2026Updated 4 months ago
earth-insights / ClassTrans
View on GitHub
2nd Place in the OpenEarthMap Land Cover Mapping Few-Shot Challenge
☆18Apr 22, 2024Updated 2 years ago
ChenHongruixuan / BRIGHT
View on GitHub
[ESSD 2025 & IEEE DFC 2025 & CVPRW 2026] Bright: A globally distributed multimodal VHR dataset for all-weather disaster response
☆247Updated this week
liliu-avril / Awesome-Segment-Anything
View on GitHub
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
☆1,215Updated this week
MitsuiChen14 / DGTRS
View on GitHub
☆32Jun 10, 2026Updated last month
yliu-cs / PiTe
View on GitHub
[ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model
☆17Feb 13, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Dayan-Guan / DA-VSN
View on GitHub
Code for <Domain Adaptive Video Segmentation via Temporal Consistency Regularization> in ICCV 2021
☆42Jul 5, 2022Updated 4 years ago
RunyuFan / UisNet-TGRS-2022
View on GitHub
Code for TGRS 2022 paper "Fine-scale Urban Informal Settlements Mapping by Fusing Remote Sensing Images and Building Data via a Transform…
☆13Apr 10, 2025Updated last year
HarborYuan / ovsam
View on GitHub
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
☆1,031Aug 4, 2025Updated 11 months ago
mmendiet / GFM
View on GitHub
☆79Nov 1, 2023Updated 2 years ago
Incalos / Image-Capture-With-RealSense
View on GitHub
This project involves using Intel Realsense to capture RGB images, depth images, and pseudo colored depth images, and is suitable for cre…
☆38May 26, 2023Updated 3 years ago
palmdong / SMMCL
View on GitHub
[WACV 2024] Understanding Dark Scenes by Contrasting Multi-Modal Observations
☆13Dec 1, 2025Updated 7 months ago
AV-Odyssey / AV-Odyssey
View on GitHub
This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
☆31Dec 23, 2024Updated last year