ROOT: VLM based System for Indoor Scene Understanding and Beyond
☆40Jan 22, 2025Updated last year
Alternatives and similar repositories for ROOT
Users that are interested in ROOT are comparing it to the libraries listed below
Sorting:
- Embodied Instruction Following in Unknown Environments☆17Dec 8, 2025Updated 2 months ago
- Code for [AAAI 2026] AffordDex: Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors☆26Dec 26, 2025Updated 2 months ago
- Introduce a novel Video Trimming (VT) task and proposes an agent-based approach (AVT) for detecting wasted footage, selecting valuable se…☆23Jan 20, 2025Updated last year
- Extreme Rotation Estimation using Dense Correlation Volumes☆45Jan 10, 2023Updated 3 years ago
- Official repository for gathering data of Revisit Human-Scene Interaction via Space Occupancy (ECCV 2024).☆28Sep 29, 2024Updated last year
- CAR: Class-aware Regularizations for Semantic Segmentation (ECCV-2022)☆30Oct 26, 2022Updated 3 years ago
- ☆27Sep 8, 2021Updated 4 years ago
- ☆30Apr 21, 2022Updated 3 years ago
- FSD Tesla Open-source. Real-Time Environment Reconstruction System for Autonomous Vehicles☆16Jan 8, 2026Updated last month
- ☆50Feb 5, 2026Updated 3 weeks ago
- Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)☆37May 16, 2022Updated 3 years ago
- ☆23Oct 14, 2025Updated 4 months ago
- ☆42Jul 9, 2025Updated 7 months ago
- CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning☆38Mar 21, 2025Updated 11 months ago
- [IROS 2024] Incrementally Building Room-Scale Language-Embedded Gaussian Splats (LEGS) with a Mobile Robot☆58May 7, 2025Updated 9 months ago
- [CVPR'2022 Oral] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation☆32Oct 19, 2023Updated 2 years ago
- [ITSC-2023] HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection☆40Aug 11, 2023Updated 2 years ago
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 6 months ago
- NOCaL: Calibration-Free Semi-Supervised Learning of Odometry and Camera Intrinsics☆10May 12, 2024Updated last year
- Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)☆11Oct 24, 2021Updated 4 years ago
- Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the…☆13Aug 7, 2024Updated last year
- This is a project on visual spatial reasoning tasks-SIBench☆25Jan 12, 2026Updated last month
- ☆19Jan 16, 2026Updated last month
- [NeurIPS'25 Spotlight] This is the official codebase for the paper: STAR: A Benchmark for Astronomical Star Fields Super-Resolution☆15Oct 9, 2025Updated 4 months ago
- [ICCV 2025] RadarSplat: Radar Gaussian Splatting for High-Fidelity Data Synthesis and 3D Reconstruction of Autonomous Driving Scenes☆22Feb 10, 2026Updated 3 weeks ago
- Implementation of "Make One-Shot Video Object Segmentation Efficient Again” and the semi-supervised fine-tuning "e-OSVOS" approach (NeurI…☆36Mar 24, 2021Updated 4 years ago
- ☆77Aug 29, 2025Updated 6 months ago
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…☆64Jan 27, 2026Updated last month
- ☆20Oct 15, 2025Updated 4 months ago
- ☆15Jul 22, 2024Updated last year
- ☀️ [ArXiv 2025] Rasterizing Wireless Radiance Field via Deformable 2D Gaussian Splatting☆21Dec 10, 2025Updated 2 months ago
- JPush's officially supported PhoneGap/Cordova plugin (Android & iOS). 极光推送官方支持的 PhoneGap/Cordova ionic2/3 Native插件(Android & iOS)。 http:/…☆10Jul 9, 2017Updated 8 years ago
- ☆11Apr 10, 2024Updated last year
- LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussians☆22Jan 10, 2025Updated last year
- An unofficial re-implementation of Graph Structure of Neural Networks (Jiaxuan You · Kaiming He · Jure Leskovec · Saining Xie) ICML 2020☆10Jul 27, 2020Updated 5 years ago
- NightSurveillance Sataset for Pedestrian Detection☆11Jul 30, 2020Updated 5 years ago
- ☆11Aug 5, 2024Updated last year
- How to create, train and quantize network, then integrate it into pre/post image processing and generate CUDA C++ code for targeting Jets…☆12May 7, 2025Updated 9 months ago
- [NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocen…☆22Jun 17, 2025Updated 8 months ago