A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds
☆29Jan 19, 2025Updated last year
Alternatives and similar repositories for spot-compose
Users that are interested in spot-compose are comparing it to the libraries listed below
Sorting:
- ☆15Jul 9, 2021Updated 4 years ago
- ☆17Nov 15, 2023Updated 2 years ago
- Lightweight Self-Supervised Monocular Depth Estimation Based on Transformer☆23Sep 24, 2024Updated last year
- Official Code for ShaSTA☆23Oct 23, 2023Updated 2 years ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆36Dec 2, 2025Updated 3 months ago
- [ECCV 2024] MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps☆25Mar 3, 2025Updated last year
- Official code for "4D-StOP: Panoptic Segmentation of 4D LiDAR using Spatio-temporal Object Proposal Generation and Aggregation"☆27Nov 15, 2022Updated 3 years ago
- Code for Stable Control Representations☆26Apr 5, 2025Updated 11 months ago
- [TCSVT‘24] SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation☆39May 27, 2025Updated 9 months ago
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆28Jan 24, 2024Updated 2 years ago
- Implementation of the RA-L2023 paper: Part-Guided 3D RL for Sim2Real Articulated Object Manipulation☆29Dec 27, 2024Updated last year
- The official implementation of "Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation" (CVPR 2024)☆28Jul 27, 2024Updated last year
- ☆31Jun 21, 2024Updated last year
- Official implementation of Image2Point.☆127Nov 16, 2022Updated 3 years ago
- Implementation of ECCV2022 paper - LiDAL: Inter-frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation☆34Nov 22, 2022Updated 3 years ago
- A Deepfake detector based on hybrid EfficientNet CNN and Vision Transformer archietcture. The model is explainable by rendering a heatma…☆15Mar 16, 2022Updated 3 years ago
- DetMatch: Two Teachers are Better Than One for Joint 2D and 3D Semi-Supervised Object Detection☆37Jun 9, 2023Updated 2 years ago
- ☆39Sep 30, 2023Updated 2 years ago
- [ECCV2022] Spike Transformer: Monocular Depth Estimation for Spiking Camera☆33Dec 17, 2024Updated last year
- IERG5350 Reinforcement Learning Course Project based on the Stanford AI lab's work on multimodal representation.☆31Apr 2, 2021Updated 4 years ago
- LiDAR-based Online 3D Video Object Detection with Graph-based Message Passing and Spatiotemporal Transformer Attention (CVPR20)☆71Apr 2, 2020Updated 5 years ago
- Code of the all the data augmentation [ Based on our survey, that will soon be published ]☆10Jul 5, 2023Updated 2 years ago
- ☆16Feb 27, 2026Updated last week
- ☆12Apr 1, 2025Updated 11 months ago
- Official code for AL-PINNS: Augmented Lagrangian relaxation method for Physics-Informed Neural Networks☆12Jul 29, 2023Updated 2 years ago
- A virtual musical instrument built using Google MediaPipe.☆12Oct 10, 2022Updated 3 years ago
- TransientViT: A novel CNN - Vision Transformer hybrid real/bogus transient classifier for the Kilodegree Automatic Transient Survey☆10Nov 7, 2024Updated last year
- Implementation of a simple linear regression algorithm in MAMBA☆10Feb 12, 2020Updated 6 years ago
- This is the official GDSC repo with all of the source code presented in the video tutorials☆14Jun 27, 2023Updated 2 years ago
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆16Feb 15, 2023Updated 3 years ago
- Public repository for the 3DV 2024 spotlight paper "SALUDA: Surface-based Automotive Lidar Unsupervised Domain Adaptation".☆38Oct 30, 2024Updated last year
- [ECCV 2024] SDK for MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty☆43Feb 16, 2026Updated 2 weeks ago
- Official implementation for the paper "Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection"☆43Dec 14, 2024Updated last year
- Open Vocabulary Object Navigation☆117May 15, 2025Updated 9 months ago
- CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes☆12Sep 2, 2024Updated last year
- Tutorial for custom hand joint tracking on HoloLens 2☆13May 16, 2021Updated 4 years ago
- Track 5: Cross-Platform 3D Object Detection☆21Aug 16, 2025Updated 6 months ago
- Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…☆11May 16, 2024Updated last year
- ROS wrapper of Nvidia Contact-graspnet model.☆17Jul 3, 2023Updated 2 years ago