fredfyyang / Touch-and-Go
☆25Updated last year
Alternatives and similar repositories for Touch-and-Go:
Users that are interested in Touch-and-Go are comparing it to the libraries listed below
- Binding Touch to Everything: Learning Unified Multimodal Tactile Representations☆28Updated last week
- ☆65Updated this week
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆122Updated last year
- This the official repository of OCL (ICCV 2023).☆19Updated 10 months ago
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆53Updated 3 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 4 months ago
- ☆42Updated 2 months ago
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆24Updated 7 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆34Updated last year
- ☆42Updated 9 months ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆32Updated 9 months ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆113Updated 2 months ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆35Updated 3 months ago
- ☆65Updated 2 months ago
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆42Updated 6 months ago
- ☆11Updated last year
- ☆108Updated last year
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆82Updated last year
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆59Updated last year
- Latent Motion Token as the Bridging Language for Robot Manipulation☆71Updated last week
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.☆96Updated last year
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆54Updated 5 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆46Updated last month
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆41Updated 11 months ago
- Official implementation of Language Conditioned Spatial Relation Reasoning for 3D Object Grounding (NeurIPS'22).☆57Updated 2 years ago
- Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"☆50Updated 10 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆32Updated 3 weeks ago
- ☆23Updated 6 months ago
- CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"☆18Updated 7 months ago