aminebdj / 3D-OWISLinks
[NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of training.
☆67Updated last year
Alternatives and similar repositories for 3D-OWIS
Users that are interested in 3D-OWIS are comparing it to the libraries listed below
Sorting:
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆53Updated 6 months ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆114Updated 6 months ago
- ☆41Updated last year
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆95Updated 3 months ago
- Official Implementation of DINO-Foresight: Looking into the Future with DINO☆66Updated 3 weeks ago
- Official code for "JAFAR: Jack up Any Feature at Any Resolution"☆161Updated 2 weeks ago
- Improving 3D Large Language Model via Robust Instruction Tuning☆62Updated 6 months ago
- ☆91Updated 8 months ago
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆23Updated 3 months ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆41Updated 9 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆121Updated last year
- [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding☆45Updated last year
- Unifying 2D and 3D Vision-Language Understanding☆104Updated last month
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆72Updated last year
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆70Updated this week
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆104Updated 5 months ago
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆26Updated 11 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆95Updated 7 months ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆61Updated 11 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆31Updated 3 months ago
- [ICLR 2024] AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation☆122Updated 5 months ago
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆51Updated 2 months ago
- OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models☆65Updated last month
- [ICCV 2025] 3DGraphLLM is a model that uses a 3D scene graph and an LLM to perform 3D vision-language tasks.☆77Updated last month
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)☆162Updated 3 months ago
- [ECCV2024] Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding☆124Updated last year
- [CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs☆46Updated last year
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆36Updated last year
- [ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation☆39Updated last week
- Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆82Updated last month