ROOT: VLM based System for Indoor Scene Understanding and Beyond
☆41Jan 22, 2025Updated last year
Alternatives and similar repositories for ROOT
Users that are interested in ROOT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neuroscience Inspired Agent Reasoning Framework☆31May 19, 2025Updated 11 months ago
- [ACM TOMM] Official implementation of "TextCoT: Zoom-In for Enhanced Multimodal Text-Rich Image Understanding"☆45Feb 27, 2026Updated 2 months ago
- Extreme Rotation Estimation using Dense Correlation Volumes☆44Jan 10, 2023Updated 3 years ago
- Embodied Instruction Following in Unknown Environments☆17Dec 8, 2025Updated 4 months ago
- SuperGS: Super-Resolution 3D Gaussian Splatting Enhanced by Variational Residual Features and Uncertainty-Augmented Learning☆11May 24, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Benchmark codebase for 2D range finder based people detectors using the FROG dataset☆12Oct 20, 2025Updated 6 months ago
- A demo project of using ChatGPT to create Slate UI with TAPython in Unreal Engine 5. TAPython uses JSON for the user interface, which i…☆17Dec 30, 2023Updated 2 years ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- Manipulating semantic data within Python☆20Jan 14, 2025Updated last year
- Collections of object goal navigation papers in recent top-tier conferences.☆14Sep 24, 2022Updated 3 years ago
- LVAS-Agent Code Base☆20Apr 15, 2025Updated last year
- ☆15Jun 14, 2025Updated 10 months ago
- [CoRL 2024] Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with Large Language Models☆36Dec 7, 2024Updated last year
- CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning☆38Mar 21, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2024] Shadows Don’t Lie and Lines Can’t Bend! Generative Models don’t know Projective Geometry...for now☆49Jun 19, 2024Updated last year
- Code for RA-L paper "One-shot Learning for Task-oriented Grasping"☆12May 9, 2024Updated last year
- ☆14Mar 23, 2024Updated 2 years ago
- ☆27Jun 28, 2022Updated 3 years ago
- [RA-L] DRAGON: A Dialogue-Based Robot for Assistive Navigation with Visual Language Grounding☆18Apr 17, 2024Updated 2 years ago
- [IEEE RA-L & ICRA 2026] Semantic-Driven Voxel Representation for LiDAR–Inertial Odometry☆44Nov 20, 2025Updated 5 months ago
- CAR: Class-aware Regularizations for Semantic Segmentation (ECCV-2022)☆30Oct 26, 2022Updated 3 years ago
- RGB-D Camera Tracking Evaluation☆15Jan 17, 2020Updated 6 years ago
- Official repository for gathering data of Revisit Human-Scene Interaction via Space Occupancy (ECCV 2024).☆29Sep 29, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The official repository for the paper "Statler: State-Maintaining Language Models for Embodied Reasoning"☆13Jun 10, 2024Updated last year
- Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation☆14Mar 31, 2026Updated last month
- ☆12Aug 29, 2025Updated 8 months ago
- Pixel-ImageNet☆45Feb 24, 2022Updated 4 years ago
- FSD Tesla Open-source. Real-Time Environment Reconstruction System for Autonomous Vehicles☆20Jan 8, 2026Updated 3 months ago
- Hands-On Tutorial on Building Multimodal RAG Systems☆13Apr 10, 2025Updated last year
- Implementation of the Paper Scene-Graph ViT☆10Dec 20, 2024Updated last year
- Archives for Triton Inference Server Practices☆15Feb 28, 2022Updated 4 years ago
- ECCV 2022: Learning Shadow Correspondence for Video Shadow Detection☆14Jul 18, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Feb 18, 2014Updated 12 years ago
- super-resolution☆12Aug 2, 2019Updated 6 years ago
- ☆53Feb 12, 2026Updated 2 months ago
- ☆54Feb 5, 2026Updated 3 months ago
- Code for paper: "Few-Shot In-Context Imitation Learning via Implicit Graph Alignment"☆26Apr 5, 2024Updated 2 years ago
- Official repository of "Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion" (ACMMM 2024)☆15Oct 31, 2024Updated last year
- Code for RANet: Region Attention Network for Semantic Segmentation☆33May 26, 2021Updated 4 years ago