MIT-MI / SmellNetLinks
☆40Updated 2 months ago
Alternatives and similar repositories for SmellNet
Users that are interested in SmellNet are comparing it to the libraries listed below
Sorting:
- Code repository for IMU2CLIP(https//arxiv.org/pdf/2210.14395.pdf)☆96Updated 2 years ago
- ☆19Updated last year
- Codes, datasets, and synthetic dataset generator about the paper "LiCamPose: Combining Multi-View LiDAR and RGB Cameras for Robust Single…☆14Updated 8 months ago
- Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"☆217Updated 2 years ago
- ☆37Updated 7 months ago
- [CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"☆84Updated 2 years ago
- [ICML 2023] Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining☆153Updated last year
- ObjectFolder Dataset☆169Updated 3 years ago
- [CVPR2023]Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning☆16Updated 2 years ago
- ☆149Updated 2 years ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆84Updated last year
- RoCoG-v2 (Robot Control Gestures) is a dataset intended to support the study of synthetic-to-real and ground-to-air video domain adaptati…☆16Updated last year
- [CVPR 2025] "DepthCues: Evaluating Monocular Depth Perception in Large Vision Models", Duolikun Danier, Mehmet Aygün, Changjian Li, Hakan…☆21Updated 10 months ago
- [AAAI 2024-Oral] EPCL: Frozen CLIP Transformer is An Efficient Point Cloud Encoder☆35Updated last year
- A comprehensive surevy on Multimodal Models in 3D☆74Updated last year
- Official code release of "CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition"☆242Updated 2 years ago
- A curated list of egocentric (first-person) vision and related area resources☆305Updated last year
- Bidirectional Mapping between Action Physical-Semantic Space☆33Updated 4 months ago
- Code for the ECCV22 paper "Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds"☆94Updated 2 years ago
- Official code for NeurIPS 2023 SpotLight: VoxDet: Voxel Learning for Novel Instance Detection☆30Updated 2 years ago
- [CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies☆71Updated 5 months ago
- ☆28Updated last year
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆124Updated 2 years ago
- [ICLR 2024] This is the official code of the paper "V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection"☆133Updated last year
- ☆77Updated 2 years ago
- ☆27Updated last year
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆73Updated last year
- SceneFun3D ToolKit☆166Updated 9 months ago
- [CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…☆311Updated last year
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆117Updated 8 months ago