ojh6404 / deep_vision_rosLinks
ROS package for SOTA Computer Vision Models including SAM, Cutie, GroundingDINO, YOLO-World, VLPart, DEVA and MaskDINO.
☆48Updated last year
Alternatives and similar repositories for deep_vision_ros
Users that are interested in deep_vision_ros are comparing it to the libraries listed below
Sorting:
- Pytorch implementation for CtRNet-X☆34Updated 5 months ago
- ☆27Updated 2 years ago
- Long-term Human Trajectory Prediction using 3D DSGs☆35Updated 6 months ago
- Legged Open-Vocabulary Object Navigator☆22Updated last month
- Set of demo to try Isaac ROS with Isaac SIM☆42Updated 2 years ago
- [T-RO 2024] Official Software Development Kit for UT Campus Object Dataset (CODa)☆21Updated 8 months ago
- This project integrates autonomous forklift control using Isaac Sim and ROS, enabling real-time monitoring and interaction. The system au…☆17Updated 6 months ago
- SORT3D, an LLM-based object-centric grounding and indoor navigation system employing a spatial reasoning toolbox and state of the art 2D …☆55Updated last month
- [IROS 2024] [ICML 2024 Workshop Differentiable Almost Everything] MonoForce: Learnable Image-conditioned Physics Engine☆81Updated 2 months ago
- [CVPR Workshop 2025 - OpenSun3D] ForesightNav: Learning Scene Imagination for Efficient Exploration☆50Updated 4 months ago
- BehAV: Behavioral Rule Guided Autonomy Using VLM for Robot Navigation in Outdoor Scenes (ICRA'25)☆26Updated 10 months ago
- [CVPR 2025] CRISP: Object Pose and Shape Estimation with Test-Time Adaptation☆78Updated 5 months ago
- Performance benchmarking for NVIDIA-accelerated Isaac ROS packages☆25Updated last month
- ☆32Updated 6 months ago
- Evaluation of Visual Semantic Navigation Models in Real Robots☆22Updated 3 months ago
- Offcial code for the ECCV2024 paper "Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities"☆24Updated 10 months ago
- ☆127Updated 7 months ago
- ☆16Updated 5 months ago
- [ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation☆128Updated last year
- The ESMStereo models are designed with low computational complexity to achieve an acceptable balance between accuracy and speed, which ma…☆48Updated last month
- ☆20Updated 2 years ago
- Last-Mile Embodied Visual Navigation https://jbwasse2.github.io/portfolio/SLING/☆28Updated 2 years ago
- ☆25Updated 3 months ago
- Interactive Distance Field Mapping and Planning to Enable Human-Robot Collaboration☆26Updated 10 months ago
- [IROS2024] STAIR: Semantic-Targeted Active Implicit Reconstruction☆17Updated last year
- An online open-vocabulary mapping system that enables natural language querying to navigate dynamic scenes, with support for ROS.☆68Updated this week
- ROS2 wrapper for depth anything☆13Updated last year
- TCC-IRoNL is a novel framework that leverages large language models (LLMs) and multi-model vision-language models (VLMs) to enable ROS-ba…☆19Updated last month
- [CoRL2023] Open-Vocabulary Scene-Graph☆69Updated last year
- ☆15Updated 3 months ago