congvvc / HyperSegLinks
[CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".
โ175Updated 11 months ago
Alternatives and similar repositories for HyperSeg
Users that are interested in HyperSeg are comparing it to the libraries listed below
Sorting:
- [ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"โ259Updated 10 months ago
- [NeurIPS2025 Spotlight ๐ฅ ] Official implementation of ๐ธ "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Languโฆโ244Updated last week
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generationโ150Updated last month
- Vision Manus: Your versatile Visual AI assistantโ293Updated last month
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perceptionโ142Updated 5 months ago
- A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-ofโฆโ160Updated 2 weeks ago
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Modelsโ150Updated last year
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Modelโ194Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Predictionโ195Updated last year
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"โ86Updated 7 months ago
- [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detectionโ184Updated 7 months ago
- โ59Updated last year
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anythingโ69Updated last year
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"โ111Updated 3 weeks ago
- [NeurIPS 2025 Workshop] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"โ52Updated 4 months ago
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examplesโ63Updated last year
- โ94Updated 3 months ago
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inferenceโ174Updated last year
- [TPAMI2025&CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.โ183Updated last year
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Modelโ76Updated 3 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"โ75Updated last year
- Recognize Any Regionsโ121Updated 10 months ago
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Futureโ204Updated 7 months ago
- [CVPR 2024] PixelLM is an effective and efficient LMM for pixel-level reasoning and understanding.โ238Updated 9 months ago
- [CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"โ162Updated last year
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolutionโ55Updated 8 months ago
- โ128Updated last year
- [AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Videoโฆโ89Updated 10 months ago
- โ69Updated last year
- ๐ฎ UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)โ174Updated 3 weeks ago