congvvc / HyperSegLinks
[CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".
โ178Updated last year
Alternatives and similar repositories for HyperSeg
Users that are interested in HyperSeg are comparing it to the libraries listed below
Sorting:
- [ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"โ268Updated last year
- [NeurIPS2025 Spotlight ๐ฅ ] Official implementation of ๐ธ "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Languโฆโ262Updated 2 months ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perceptionโ148Updated 6 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generationโ156Updated last month
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Modelโ200Updated last year
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"โ88Updated 2 weeks ago
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Modelsโ156Updated last year
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inferenceโ180Updated last year
- Vision Manus: Your versatile Visual AI assistantโ305Updated 2 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Predictionโ201Updated last year
- [TPAMI2025&CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.โ187Updated last year
- [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detectionโ188Updated 9 months ago
- โ101Updated 4 months ago
- A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-ofโฆโ180Updated 3 weeks ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anythingโ70Updated last year
- โ59Updated last year
- ๐ฎ UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)โ215Updated 2 months ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"โ120Updated 2 months ago
- [CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"โ174Updated last year
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examplesโ65Updated last year
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"โ75Updated last year
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolutionโ57Updated 10 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentationโ54Updated last year
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Modelโ82Updated 5 months ago
- โ71Updated 2 years ago
- โ147Updated last year
- โ59Updated last year
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentatiโฆโ72Updated last year
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024โ87Updated last year
- Recognize Any Regionsโ122Updated last year