emb-ai / octo-pytorchLinks
☆12Updated last month
Alternatives and similar repositories for octo-pytorch
Users that are interested in octo-pytorch are comparing it to the libraries listed below
Sorting:
- ☆26Updated 8 months ago
- ☆11Updated last month
- Depth map compression by colorization in vectorized form☆13Updated 10 months ago
- Code Release for "MaskTerial: A Foundation Model for Automated 2D Material Flake Detection"☆11Updated last month
- [ICCV 2025 Oral] CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation☆20Updated last month
- ☆15Updated 5 months ago
- 本项目主要是2025届浙江大学软件学院夏令营(AI营)的考核项目☆11Updated 6 months ago
- Calibrate both hand-in-eye and hand-to-eye simultaneously with colmap☆11Updated 10 months ago
- Code for the paper "ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions" published at CVPR 2025☆18Updated 5 months ago
- DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆167Updated last week
- [ICCV 2025] Official implementation of SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts☆22Updated 8 months ago
- ☆14Updated 10 months ago
- Official Implementation of Towards Open Vocabulary Video Semantic Segmentation☆12Updated 6 months ago
- unofficial☆11Updated 10 months ago
- ☆17Updated 5 months ago
- IsaacSim Extension for Dynamic Objects in Matterport3D Environments for AdaVLN research☆47Updated 5 months ago
- ☆11Updated 10 months ago
- ☆44Updated 3 months ago
- DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments☆21Updated 4 months ago
- This repository contains the implementation of the paper: "ChatCam: Empowering Camera Control through Conversational AI", NeurIPS 2024.☆17Updated 9 months ago
- Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆138Updated last month
- [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation☆55Updated 5 months ago
- ☆81Updated 3 months ago
- [CVPR 2025] SAM2Object: Consolidating View Consistency via SAM2 for Zero-Shot 3D Instance Segmentation☆24Updated last month
- Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"☆207Updated this week
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆43Updated 8 months ago
- official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"☆33Updated last year
- 基于selenium的SJTU体育场馆预约脚本☆12Updated 10 months ago
- Official implementation of NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments (ICCV'25).☆26Updated 2 months ago
- Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)☆81Updated 3 months ago