congvvc / HyperSeg
Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".
☆32Updated this week
Alternatives and similar repositories for HyperSeg:
Users that are interested in HyperSeg are comparing it to the libraries listed below
- OVSegmentor, CVPR23☆55Updated 7 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆45Updated 5 months ago
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆90Updated 5 months ago
- DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution☆39Updated last month
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆35Updated last year
- ☆37Updated 3 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆64Updated 2 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆30Updated 6 months ago
- ☆23Updated 2 months ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆38Updated last month
- ☆21Updated 4 months ago
- OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)☆24Updated 3 weeks ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆42Updated 4 months ago
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆80Updated 9 months ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆70Updated 3 months ago
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples☆39Updated last month
- ☆29Updated 8 months ago
- ☆58Updated last year
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆57Updated 4 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆65Updated 2 months ago
- [NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation☆20Updated 11 months ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆27Updated 8 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆31Updated 6 months ago
- Official Implementation of ICCV 2023 Paper - SegPrompt: Boosting Open-World Segmentation via Category-level Prompt Learning☆109Updated 3 months ago
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆29Updated last year
- IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆26Updated 3 weeks ago
- Text4Seg: Reimagining Image Segmentation as Text Generation☆25Updated 2 months ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆35Updated last week