OpenRobotLab / OV_PARTS
[NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation
☆79Updated 8 months ago
Alternatives and similar repositories for OV_PARTS:
Users that are interested in OV_PARTS are comparing it to the libraries listed below
- Large-Vocabulary Video Instance Segmentation dataset☆81Updated 8 months ago
- Code Release for MaskCLIP (ICML 2023)☆63Updated last year
- Can 3D Vision-Language Models Truly Understand Natural Language?☆21Updated 11 months ago
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆41Updated 2 months ago
- ☆36Updated 11 months ago
- [NeurIPS 2024] Official PyTorch Implementation of PartCLIPSeg☆45Updated 2 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆68Updated 5 months ago
- PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"☆89Updated last year
- OVSegmentor, CVPR23☆58Updated 10 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆65Updated 5 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 5 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆47Updated 7 months ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆58Updated 11 months ago
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆41Updated 2 years ago
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆84Updated last year
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆64Updated 9 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆71Updated 4 months ago
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆93Updated 7 months ago
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆77Updated 8 months ago
- [ICCV 2023] HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation☆36Updated last year
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆34Updated 8 months ago
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆147Updated 5 months ago
- [AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Refer…☆39Updated last year
- state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆29Updated 10 months ago
- [MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentation☆34Updated 2 months ago
- ☆48Updated 5 months ago
- [ICCV 2023] PyTorch implementation of RandBox☆53Updated last year
- [ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects☆40Updated 5 months ago
- [CVPR 2024 Best paper award candidate] EGTR: Extracting Graph from Transformer for Scene Graph Generation☆99Updated 8 months ago
- [ECCV2022] A PyTorch implementation of the paper "Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embo…☆13Updated last year