QuentinFitteRey/VLMSAM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QuentinFitteRey/VLMSAM)

QuentinFitteRey / VLMSAM

Qwen-SAM is a reasoning-based segmentation model that integrates Qwen 2.5 VL 7B with the Segment Anything Model (SAM), enabling fine-grained visual segmentation from complex text prompts using LoRA fine-tuning.

☆32

Alternatives and similar repositories for VLMSAM

Users that are interested in VLMSAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PolyU-ChenLab / UniPixel
View on GitHub
🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)
☆247Jan 4, 2026Updated 6 months ago
echo840 / LIRA
View on GitHub
[ICCV 2025] LIRA
☆22Nov 25, 2025Updated 7 months ago
hustvl / LENS
View on GitHub
[AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning
☆136Dec 3, 2025Updated 7 months ago
Hui-design / R1-Video-fixbug
View on GitHub
[Blog 1] Recording a bug of grpo_trainer in some R1 projects
☆23Feb 23, 2025Updated last year
baoxiaoyi / CoReS
View on GitHub
code for the paper "CoReS: Orchestrating the Dance of Reasoning and Segmentation"
☆23Nov 24, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
JIA-Lab-research / VisionReasoner
View on GitHub
[ICLR 2026] VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning
☆348Feb 9, 2026Updated 5 months ago
Dongdong-d / GroundingDino-Finetuning
View on GitHub
根据Open-GroundingDino代码训练自己的数据集，记录复现过程
☆43Jan 22, 2025Updated last year
zhangye-zoe / PathMR
View on GitHub
PathMR: Multimodal Visual Reasoning for Interpretable Pathology Analysis
☆15Aug 27, 2025Updated 10 months ago
Rooholla-KhorramBakht / FR3Py
View on GitHub
A unified Python simulation and hardware communication environment for Franka FR3 robots.
☆23Aug 15, 2024Updated last year
dario-pedro / uav-collision-avoidance
View on GitHub
Machine Learning Algorithms for drones to Avoid Dynamic Objects
☆13Jul 28, 2020Updated 5 years ago
mc-lan / Awesome-MLLM-Segmentation
View on GitHub
A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of…
☆230Jun 28, 2026Updated 3 weeks ago
MB-Team-THI / conditioned-vehicle-motion-diffusion
View on GitHub
☆18Feb 26, 2026Updated 4 months ago
wanghao9610 / X-SAM
View on GitHub
[AAAI2026] X-SAM: From Segment Anything to Any Segmentation
☆383Jul 14, 2026Updated last week
cvpr-vand / challenge
View on GitHub
Technical Challenge Repository for Visual Anomaly Detection Workshop (VAND) at CVPR
☆14Jul 21, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
wangyu-ustc / LargeScaleWashing
View on GitHub
The official implementation of the paper "Large Scale Knowledge Washing"
☆10Jun 12, 2024Updated 2 years ago
showlab / PANDA
View on GitHub
[NeurIPS 2025] PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer
☆33Oct 2, 2025Updated 9 months ago
suikei-wang / RESAnything
View on GitHub
[NeurIPS 2025] RESAnything: Attribute Prompting for Arbitrary Referring Segmentation
☆19May 26, 2026Updated last month
congvvc / LaSagnA
View on GitHub
Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".
☆63Apr 29, 2024Updated 2 years ago
Hectormxy / OP-SAM
View on GitHub
The official implementation of ICCV 25 OP-SAM "One Polyp Identifies All: One-Shot Polyp Segmentation with SAM via Cascaded Priors and Ite…
☆15Jul 9, 2025Updated last year
clownrat6 / OpenVIS
View on GitHub
[AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.
☆26Dec 30, 2024Updated last year
zhengye1995 / DCIC22-Cow
View on GitHub
DCIC22数字中国22-牛只图像分割竞赛第四名方案
☆14Jul 18, 2022Updated 4 years ago
rui-qian / UGround
View on GitHub
Rui Qian, Xin Yin, Chuanhang Deng, et al.: UGround: Towards Unified Visual Grounding with Unrolled Transformers (ICML 2026)
☆29Jun 18, 2026Updated last month
BBQtime / deformable-convolution-network-DCN-for-head-and-neck-tumor-segmentation
View on GitHub
3D deformable convolution network(DCN) for head and neck tumor segmentation
☆11May 4, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
hexdjx / VisTrack
View on GitHub
A series of improved methods are used for visual tracking
☆10Nov 29, 2025Updated 7 months ago
FYYDCC / IVT-LR
View on GitHub
Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”
☆18Jan 27, 2026Updated 5 months ago
bruno686 / VisPlay
View on GitHub
[CVPR'26] VisPlay: Self-Evolving Vision-Language Models
☆63Feb 25, 2026Updated 4 months ago
ragavsachdeva / The-Change-You-Want-to-See
View on GitHub
The official implementation of the paper The Change You Want to See (WACV 2023).
☆70Mar 27, 2024Updated 2 years ago
berkeley-hipie / segllm
View on GitHub
Code release for "SegLLM: Multi-round Reasoning Segmentation"
☆129Feb 20, 2025Updated last year
Brickzhuantou / CalibDepth
View on GitHub
☆22Aug 30, 2023Updated 2 years ago
vincekurtz / rnn_collvoid
View on GitHub
Dynamic collision avoidance using LSTM to predict time-dependent obstacle behaviors
☆23Nov 14, 2018Updated 7 years ago
gym487 / SAR_exp
View on GitHub
Experiments about Synthetic Aperture Radar
☆15Aug 29, 2019Updated 6 years ago
xiaozhen228 / DictAS
View on GitHub
(ICCV 2025) DictAS: A Framework for Class-Generalizable Few-Shot Anomaly Segmentation via Dictionary Lookup
☆58Dec 13, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
see-say-segment / sesame
View on GitHub
🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"
☆47Jun 16, 2024Updated 2 years ago
jiangmengli / MetaMask
View on GitHub
☆13Sep 16, 2022Updated 3 years ago
logan-0623 / PG-SAM
View on GitHub
Efficient Semantic Fine-grained Prior Generation and Refinement Decoder Based on SAM for Improved Multi-organ Segmentation
☆22Mar 26, 2025Updated last year
chase6305 / 7DofSRSKinematics
View on GitHub
Kinematics analytical solution and inverse solution for KUKA IIWA 7DOF robot.
☆15Jan 13, 2025Updated last year
ktonal / mimikit
View on GitHub
Music Modeling Kit
☆22Jan 10, 2025Updated last year
2209520576 / Infrared-Dim-Target-Detection-Based-on-Human-Visual-Mechanism
View on GitHub
☆12Aug 22, 2021Updated 4 years ago
jcwang0602 / MLLMSeg
View on GitHub
MLLMSeg: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder
☆57Jun 12, 2026Updated last month