minfenli / Segment-Anything-CLIPLinks
Using Segment-Anything and CLIP to generate pixel-aligned semantic features.
☆41Updated 2 years ago
Alternatives and similar repositories for Segment-Anything-CLIP
Users that are interested in Segment-Anything-CLIP are comparing it to the libraries listed below
Sorting:
- Source code for "To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation", ICCV 2023☆48Updated last year
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆23Updated 3 months ago
- [ICLR'23] GOOD: Exploring Geometric Cues for Detecting Objects in an Open World☆39Updated 2 years ago
- [Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…☆63Updated 2 years ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆67Updated last year
- ☆51Updated last year
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆121Updated last year
- Official implementation of the WACV 2024 paper CLIP-DIY☆33Updated last year
- (AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models☆54Updated last year
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆70Updated 9 months ago
- ☆73Updated 7 months ago
- This repository provides a multi task benchmark for instance segmentation, depth estimation, and 3D object detection.☆14Updated 2 years ago
- The official implementation of “Segment Anything Model is a Good Teacher for Local Feature Learning”.☆120Updated 3 months ago
- [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding☆45Updated last year
- ☆47Updated last year
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆53Updated 6 months ago
- Code Release for MaskCLIP (ICML 2023)☆71Updated last year
- MIMIC: Masked Image Modeling with Image Correspondences☆16Updated last year
- 1-shot image segmentation using Stable Diffusion☆141Updated last year
- (ICLR 2024, CVPR 2024) SparseFormer☆75Updated 10 months ago
- Open-vocabulary Semantic Segmentation☆33Updated last year
- [AAAI 2024] SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection☆42Updated last year
- Official implementation of "Can Language Understand Depth?"☆81Updated 2 years ago
- ☆43Updated 2 years ago
- Official implementation for [3DV 2024] `Pix4Point: Image Pretrained Standard Transformers for 3D Point Cloud Understanding`☆47Updated last year
- Official code for "Opening up Open World Tracking" (CVPR 2022)☆56Updated 2 years ago
- ROOT: VLM based System for Indoor Scene Understanding and Beyond☆32Updated 7 months ago
- ☆106Updated 2 years ago
- Point Could Mamba: Point Cloud Learning via State Space Model☆71Updated 9 months ago
- CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation☆19Updated last year