[ICCV2025] All in One: Visual-Description-Guided Unified Point Cloud Segmentation
☆28Jul 25, 2025Updated 7 months ago
Alternatives and similar repositories for VDG-Uni3DSeg
Users that are interested in VDG-Uni3DSeg are comparing it to the libraries listed below
Sorting:
- See through the Dark: Learning Illumination-affined Representations for Nighttime Occupancy Prediction (NeurIPS 2025)☆26Oct 21, 2025Updated 4 months ago
- ☆15Jun 14, 2025Updated 8 months ago
- [2026 AAAI] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation☆19Nov 8, 2025Updated 4 months ago
- ECCV 2024 STMA & CVPR 2024 1st MOSE & 1st VOT Challenge & 1st LSVOS v6☆12Oct 16, 2024Updated last year
- OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning☆27May 24, 2025Updated 9 months ago
- Chain_of_Thoughts_3D_Visual_Grounding☆19Apr 20, 2024Updated last year
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding☆20Dec 5, 2025Updated 3 months ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆47Feb 29, 2024Updated 2 years ago
- ☆31Nov 6, 2024Updated last year
- ☆22Oct 21, 2024Updated last year
- [CVPR 2025] Official codes for the paper 'Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State …☆36Apr 8, 2025Updated 11 months ago
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆31Jun 5, 2025Updated 9 months ago
- [TCSVT‘24] SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation☆39May 27, 2025Updated 9 months ago
- ☆31Mar 5, 2025Updated last year
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆35Sep 9, 2024Updated last year
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- Official implementation of "Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers" (NeurIPS 2025)☆68Sep 23, 2025Updated 5 months ago
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆41Jun 7, 2024Updated last year
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 5 months ago
- [ECCV 2024] VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement☆36Jul 29, 2024Updated last year
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 3 months ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆43Aug 18, 2025Updated 6 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆47Apr 28, 2023Updated 2 years ago
- [ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Clouds☆52Jan 10, 2025Updated last year
- ☆11Apr 28, 2023Updated 2 years ago
- ☆27Jan 9, 2026Updated 2 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 7 months ago
- ☆11Feb 7, 2025Updated last year
- ViViDex implementation under the SAPIEN simulator, ICRA 2025☆17Apr 9, 2025Updated 11 months ago
- [TCSVT‘26] LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation☆17Feb 22, 2026Updated 2 weeks ago
- [IROS 2025] EgoLoc: Zero-Shot Temporal Interaction Localization for Egocentric Videos☆33Jan 13, 2026Updated last month
- A Python script to delete all comment and submission data from a given Reddit account.☆11Jan 5, 2021Updated 5 years ago
- ☆10Apr 7, 2025Updated 11 months ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)☆70May 2, 2025Updated 10 months ago
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)☆179Feb 27, 2026Updated last week
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".☆11Nov 13, 2024Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago