wkentaro / osam
Get up and running with SAM, EfficientSAM, YOLO-World, and other promptable vision models locally.
☆49Updated 5 months ago
Alternatives and similar repositories for osam:
Users that are interested in osam are comparing it to the libraries listed below
- ☆58Updated 5 months ago
- Python scripts performing Metric Monocular Depth Estimation using the Unidepth model in ONNX.☆78Updated 6 months ago
- ROS package for SOTA Computer Vision Models including SAM, Cutie, GroundingDINO, YOLO-World, VLPart, DEVA and MaskDINO.☆42Updated 5 months ago
- Detect corn stalks for micro-sensor insertion☆13Updated 10 months ago
- ☆32Updated this week
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆40Updated 4 months ago
- ☆28Updated last year
- [ICRA2023] Demo of Mono-STAR☆31Updated 11 months ago
- 3DGraphLLM is a model that uses a 3D scene graph and an LLM to perform 3D vision-language tasks.☆33Updated 3 weeks ago
- A diffusion model-based stereo depth estimation framework that can predict state-of-the-art depth and restore noisy depth maps for transp…☆46Updated 3 weeks ago
- A project for computing high-quality ground truth training examples for RGB-D data.☆43Updated last year
- Official code of PrimA6D☆44Updated last year
- Utilizing segment-anything to help the region selection of 3D point cloud or mesh.☆44Updated last year
- Webpage☆16Updated 11 months ago
- LLGS: Illuminating Gaussian Splatting via absorptance Modulation☆18Updated 3 months ago
- Uncertainty-Aware Rotation Estimation in Manhattan Environments using only monocular cues.☆64Updated 5 months ago
- ☆95Updated last year
- ☆17Updated 2 years ago
- Minimal code for Tapir model inference in Pytorch☆16Updated 5 months ago
- Code for "Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling" (CoRL 2024)☆94Updated last month
- Agent-to-Sim Learning Interactive Behavior from Casual Videos.☆42Updated 3 months ago
- Integrates the vision, touch, and common-sense information of foundational models, customized to the agent's perceptual needs.☆30Updated this week
- This is the official release for the paper "EFM3D A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models" (https//arx…☆128Updated 2 weeks ago
- ☆17Updated this week
- ☆124Updated 3 weeks ago
- Language instructions to mycobot using GPT-4V☆22Updated last year
- Source code for ZePHyR: Zero-shot Pose Hypothesis Rating @ ICRA 2021☆24Updated 2 years ago
- INS-Conv: Incremental Sparse Convolution for Online 3D Segmentation (CVPR 2022)☆59Updated 2 years ago
- [CVPR 2024] AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings☆58Updated 7 months ago
- Python package to create manipulation scenes.☆66Updated this week