hustvl/MaskAdapter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hustvl/MaskAdapter)

hustvl / MaskAdapter

[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"

☆135

Alternatives and similar repositories for MaskAdapter

Users that are interested in MaskAdapter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hustvl / TBCM
View on GitHub
Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs
☆21Dec 16, 2025Updated 7 months ago
hustvl / GroundingSuite
View on GitHub
[ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding
☆77Jun 26, 2025Updated last year
hustvl / mmMamba
View on GitHub
The first decoder-only multimodal state space model
☆104May 19, 2025Updated last year
hustvl / OpenInst
View on GitHub
☆17Nov 17, 2023Updated 2 years ago
jiaosiyu1999 / MAFT-Plus
View on GitHub
☆60Sep 14, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
hustvl / MaTVLM
View on GitHub
☆62May 13, 2025Updated last year
hustvl / GaussTR
View on GitHub
[CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
☆217Jan 5, 2026Updated 6 months ago
hustvl / Spa3R
View on GitHub
Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning
☆51Mar 25, 2026Updated 3 months ago
hustvl / ViG
View on GitHub
[AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention
☆116Jun 17, 2024Updated 2 years ago
hustvl / MolSight
View on GitHub
[AAAI 2026] MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learni…
☆27Dec 5, 2025Updated 7 months ago
hustvl / CircuitFormer
View on GitHub
[NeurIPS 2023] CircuitFormer: Circuit as Set of Points
☆38Nov 22, 2023Updated 2 years ago
hustvl / OmniMamba
View on GitHub
[ECCV 2026] OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models
☆126Apr 25, 2025Updated last year
hustvl / ViTGaze
View on GitHub
Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"
☆62Mar 3, 2025Updated last year
mc-lan / ProxyCLIP
View on GitHub
[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
☆120Mar 26, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cvlab-kaist / CAT-Seg
View on GitHub
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
☆385Apr 11, 2024Updated 2 years ago
hustvl / CoStudent
View on GitHub
☆14Nov 19, 2024Updated last year
hustvl / InfiniteVL
View on GitHub
InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models
☆110Jul 7, 2026Updated 2 weeks ago
Qinying-Liu / Awesome-Open-Vocabulary-Semantic-Segmentation
View on GitHub
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
☆891May 20, 2026Updated 2 months ago
hustvl / LENS
View on GitHub
[AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning
☆136Dec 3, 2025Updated 7 months ago
hustvl / VGT
View on GitHub
Visual Generation Tuning
☆101Apr 16, 2026Updated 3 months ago
Hanzy1996 / OpenSeg-R
View on GitHub
OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning
☆29May 24, 2025Updated last year
hustvl / EVF-SAM
View on GitHub
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
☆505Mar 17, 2025Updated last year
hustvl / ControlAR
View on GitHub
[ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models
☆326Jun 30, 2026Updated 3 weeks ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
linsun449 / cliper.code
View on GitHub
This repo is the official pytorch implementation of the paper: CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-V…
☆41Sep 10, 2025Updated 10 months ago
ruohaoguo / ovavss
View on GitHub
Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].
☆37Nov 2, 2024Updated last year
hustvl / Snap-Snap
View on GitHub
The repository of "Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds"
☆40Sep 1, 2025Updated 10 months ago
hustvl / Query6DoF
View on GitHub
Query6DoF: Learning Sparse Queries as Implicit Shape Prior for Category-Level 6DoF Pose Estimation
☆31Jan 4, 2024Updated 2 years ago
congvvc / HyperSeg
View on GitHub
[CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".
☆182Dec 13, 2024Updated last year
hustvl / WeakSAM
View on GitHub
[ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition
☆58Apr 8, 2025Updated last year
hustvl / Senna
View on GitHub
Bridging Large Vision-Language Models and End-to-End Autonomous Driving
☆551Mar 15, 2026Updated 4 months ago
hustvl / MoDA
View on GitHub
An hardware-aware Efficient Implementation for "Mixture-of-Depths Attention".
☆274May 6, 2026Updated 2 months ago
yongliu20 / SCAN
View on GitHub
[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"
☆77Sep 23, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
hustvl / TOGS
View on GitHub
[IEEE JBHI] The official code of "TOGS: Gaussian Splatting with Temporal Opacity Offset for Real-Time 4D DSA Rendering"
☆33Sep 10, 2025Updated 10 months ago
hustvl / WeakCLIP
View on GitHub
[IJCV 2024]
☆21Nov 11, 2024Updated last year
congvvc / InstructSeg
View on GitHub
[ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"
☆56Feb 10, 2025Updated last year
hustvl / EVA-X
View on GitHub
[Nature Portfolio, npj DigitalMed] EVA-X: A foundation model for general chest X-ray analysis with self-supervised learning
☆100Jun 12, 2026Updated last month
chenxi52 / CMPF
View on GitHub
[IJCV 2026] Official implementation of the paper “CMPF: Harmonizing Cross-Model Prior Fusion for Open-Vocabulary Segmentation”
☆26Jun 15, 2025Updated last year
hustvl / Featurized-QueryRCNN
View on GitHub
Featurized Query R-CNN
☆46Jun 17, 2022Updated 4 years ago
nhw649 / EOV-Seg
View on GitHub
[AAAI 2025] Official implementation of the paper "EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation"
☆40Dec 17, 2024Updated last year