bytepioneerX / s3motLinks
☆33Updated 11 months ago
Alternatives and similar repositories for s3mot
Users that are interested in s3mot are comparing it to the libraries listed below
Sorting:
- Toolkit for JRDB dataset☆43Updated last year
- [ICCV'25] 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection☆70Updated 3 weeks ago
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated last year
- [ICLR 2025 (Oral 📢) ] Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet2…☆227Updated 7 months ago
- Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion (ICCV 2025)☆71Updated last month
- ☆45Updated 4 months ago
- [CVPR 25] Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation☆226Updated last month
- [CVPR 2025] RelationField: Relate Anything in Radiance Fields☆83Updated 7 months ago
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆73Updated last month
- [ICCV 2025] Detect Anything 3D in the Wild☆219Updated 4 months ago
- Source Code for "Map It Anywhere (MIA): Empowering Bird’s Eye View Mapping using Large-scale Public Data"☆92Updated 11 months ago
- [CVPR2024] Open-world Semantic Segmentation Including Class Similarity☆78Updated 7 months ago
- Mask4Former: Mask Transformer for 4D Panoptic Segmentation☆67Updated 5 months ago
- Code for QuantVGGT: Quantized Visual Geometry Grounded Transformer☆75Updated 2 weeks ago
- ☆46Updated 6 months ago
- Official Repo For "RockTrack"☆37Updated 7 months ago
- [NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)☆120Updated last month
- ☆97Updated 7 months ago
- [RA-L25/ICRA26] HybridTrack: A Hybrid Approach for Robust Multi-Object Tracking☆30Updated 3 months ago
- Official implementation for the ICCV 2023 paper "NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates…☆39Updated last year
- Offcial code for the ECCV2024 paper "Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities"☆25Updated last year
- [CVPR'25] LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes☆64Updated 4 months ago
- ☆53Updated last year
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆53Updated 4 months ago
- The official Implementation of "VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection" [CVPR 202…☆40Updated last year
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆119Updated 5 months ago
- [NeurIPS 2025] Official Implementation of DINO-Foresight: Looking into the Future with DINO☆119Updated last month
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆49Updated 3 months ago
- [ICCV 2025] 3DGraphLLM is a model that uses a 3D scene graph and an LLM to perform 3D vision-language tasks.☆83Updated 3 months ago
- [ICCV 2025] SAM4D: Segment Anything in Camera and LiDAR Streams☆191Updated last month