cfeng16 / UniTouch
Binding Touch to Everything: Learning Unified Multimodal Tactile Representations
☆26Updated 10 months ago
Alternatives and similar repositories for UniTouch:
Users that are interested in UniTouch are comparing it to the libraries listed below
- ☆65Updated 6 months ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆35Updated 2 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆64Updated 3 months ago
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆39Updated 3 weeks ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 3 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆33Updated last year
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆90Updated 2 months ago
- ☆42Updated last month
- Can 3D Vision-Language Models Truly Understand Natural Language?☆21Updated 9 months ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆33Updated last month
- One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)☆22Updated 5 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆47Updated 5 months ago
- Latent Motion Token as the Bridging Language for Robot Manipulation☆66Updated last month
- ☆16Updated 6 months ago
- The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆41Updated 3 weeks ago
- Egocentric Video Understanding Dataset (EVUD)☆24Updated 6 months ago
- Official code for MotionBench☆22Updated last week
- AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆59Updated 2 weeks ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆76Updated 6 months ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆58Updated 3 months ago
- ☆56Updated 4 months ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆32Updated 8 months ago
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆51Updated 2 months ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆24Updated 9 months ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆123Updated last year
- ☆47Updated 3 months ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆32Updated 4 months ago
- ☆15Updated 3 weeks ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆31Updated last month
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆19Updated last month