TACJu / Compositor
This repo contains the code for our paper Compositor: Bottom-Up Clustering and Compositing for Robust Part and Object Segmentation
☆16Updated 7 months ago
Alternatives and similar repositories for Compositor:
Users that are interested in Compositor are comparing it to the libraries listed below
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆76Updated 7 months ago
- [ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects☆37Updated 4 months ago
- ☆12Updated 2 months ago
- Code for the ECCV22 paper "Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds"☆83Updated last year
- [IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)☆20Updated 2 years ago
- Large-Vocabulary Video Instance Segmentation dataset☆77Updated 6 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆31Updated this week
- [ICCV 2023] HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation☆34Updated last year
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆32Updated 9 months ago
- This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).☆25Updated last year
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆74Updated 8 months ago
- [CVPR 2024] Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations☆15Updated last week
- PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"☆45Updated 4 months ago
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆42Updated 2 years ago
- [CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds☆53Updated 2 years ago
- ☆11Updated 6 months ago
- Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"☆14Updated 3 months ago
- ☆34Updated 7 months ago
- [MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentation☆31Updated last month
- Can 3D Vision-Language Models Truly Understand Natural Language?☆21Updated 10 months ago
- ☆12Updated last year
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Updated last year
- SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)☆32Updated 3 years ago
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆37Updated 3 weeks ago
- ☆24Updated 2 years ago
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆25Updated 8 months ago
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆25Updated 5 months ago
- ImageNet3D: Towards General-Purpose Object-Level 3D Understanding☆15Updated last month
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 4 months ago
- [NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation☆20Updated last year