aminebdj / 3D-OWIS
[NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of training.
☆67Updated last year
Alternatives and similar repositories for 3D-OWIS:
Users that are interested in 3D-OWIS are comparing it to the libraries listed below
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆98Updated 3 weeks ago
- Unifying 2D and 3D Vision-Language Understanding☆59Updated this week
- Official Implementation of DINO-Foresight: Looking into the Future with DINO☆49Updated last month
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)☆145Updated last month
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆73Updated 8 months ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆70Updated last week
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆45Updated last month
- SceneFun3D ToolKit☆130Updated 3 weeks ago
- Improving 3D Large Language Model via Robust Instruction Tuning☆53Updated last month
- [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding☆44Updated last year
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆86Updated 2 months ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆38Updated 4 months ago
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆43Updated 2 months ago
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Updated 3 weeks ago
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆23Updated 6 months ago
- [ECCV2024] Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding☆115Updated 9 months ago
- ☆17Updated 2 weeks ago
- ☆81Updated 3 months ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆75Updated 4 months ago
- Source code for "To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation", ICCV 2023☆48Updated 9 months ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆106Updated last month
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆67Updated last year
- This is the official implementation for our paper;"LAR:Look Around and Refer".☆30Updated 2 years ago
- CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation☆17Updated last year
- ☆37Updated last year
- [CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs☆37Updated 10 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆299Updated 9 months ago
- [CVPR 2025] Official code and models for Encoder-only Mask Transformer (EoMT).☆97Updated last week
- 3DGraphLLM is a model that uses a 3D scene graph and an LLM to perform 3D vision-language tasks.☆55Updated 3 months ago
- ☆54Updated last week