ROOT: VLM based System for Indoor Scene Understanding and Beyond
☆40Jan 22, 2025Updated last year
Alternatives and similar repositories for ROOT
Users that are interested in ROOT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neuroscience Inspired Agent Reasoning Framework☆30May 19, 2025Updated 10 months ago
- Official PyTorch implementation for TCSVT 23 "Detect Any Shadow: Segment Anything for Video Shadow Detection"☆67Nov 28, 2024Updated last year
- Introduce a novel Video Trimming (VT) task and proposes an agent-based approach (AVT) for detecting wasted footage, selecting valuable se…☆24Jan 20, 2025Updated last year
- Extreme Rotation Estimation using Dense Correlation Volumes☆44Jan 10, 2023Updated 3 years ago
- Benchmark codebase for 2D range finder based people detectors using the FROG dataset☆12Oct 20, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Example apps demonstrating the WRLD Android Java API to display stunning interactive 3D maps. https://www.wrld3d.com/developers/☆13Nov 3, 2022Updated 3 years ago
- A demo project of using ChatGPT to create Slate UI with TAPython in Unreal Engine 5. TAPython uses JSON for the user interface, which i…☆17Dec 30, 2023Updated 2 years ago
- Code for [AAAI 2026] AffordDex: Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors☆29Dec 26, 2025Updated 3 months ago
- Manipulating semantic data within Python☆19Jan 14, 2025Updated last year
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆18Jan 5, 2026Updated 3 months ago
- ☆35Jan 8, 2026Updated 3 months ago
- LVAS-Agent Code Base☆22Apr 15, 2025Updated last year
- ☆15Jun 14, 2025Updated 10 months ago
- [WACV 2025] Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge☆40Oct 29, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 吴恩达深度学习课程课后作业☆10Jan 28, 2020Updated 6 years ago
- [CVPR 2024] Shadows Don’t Lie and Lines Can’t Bend! Generative Models don’t know Projective Geometry...for now☆47Jun 19, 2024Updated last year
- Official implementation of the ECCV 2022 paper "CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillati…☆37Oct 5, 2022Updated 3 years ago
- CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning☆38Mar 21, 2025Updated last year
- Code for RA-L paper "One-shot Learning for Task-oriented Grasping"☆12May 9, 2024Updated last year
- ☆14Mar 23, 2024Updated 2 years ago
- ☆27Jun 28, 2022Updated 3 years ago
- Source code for our journal submission : ELD-Net: An efficient deep learning architecture for accurate saliency detection☆10Nov 27, 2017Updated 8 years ago
- [RA-L] DRAGON: A Dialogue-Based Robot for Assistive Navigation with Visual Language Grounding☆18Apr 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [IEEE RA-L & ICRA 2026] Semantic-Driven Voxel Representation for LiDAR–Inertial Odometry☆43Nov 20, 2025Updated 4 months ago
- LaTeX Template for Thesis and Reports☆14Dec 30, 2019Updated 6 years ago
- Pytorch implementation of deep fill v2 (original by Jiayu et al.)☆10Jun 26, 2019Updated 6 years ago
- The official repository for the paper "Statler: State-Maintaining Language Models for Embodied Reasoning"☆13Jun 10, 2024Updated last year
- Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation☆14Mar 31, 2026Updated 2 weeks ago
- ☆49Feb 12, 2026Updated 2 months ago
- ☆11Aug 29, 2025Updated 7 months ago
- [EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering☆17Oct 31, 2024Updated last year
- Pixel-ImageNet☆45Feb 24, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- EKF, EIF and SEIF for SLAM☆10Nov 16, 2018Updated 7 years ago
- Hands-On Tutorial on Building Multimodal RAG Systems☆13Apr 10, 2025Updated last year
- Implementation of the Paper Scene-Graph ViT☆10Dec 20, 2024Updated last year
- ☆12Feb 18, 2014Updated 12 years ago
- Code || Q&A☆11Oct 25, 2018Updated 7 years ago
- With a subtle glass-like blur that melts into the background, Ayaka highlights your workspace without overwhelming it. Vibrant accents br…☆47Jan 2, 2026Updated 3 months ago
- super-resolution☆12Aug 2, 2019Updated 6 years ago