martius-lab / videosaurLinks
Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"
☆31Updated 8 months ago
Alternatives and similar repositories for videosaur
Users that are interested in videosaur are comparing it to the libraries listed below
Sorting:
- ☆86Updated 2 months ago
 - [NeurIPS 2023] Self-supervised Object-Centric Learning for Videos☆30Updated 11 months ago
 - ☆21Updated 3 months ago
 - (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆19Updated 7 months ago
 - One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)☆45Updated last year
 - LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆44Updated 2 years ago
 - Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆58Updated 9 months ago
 - Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)☆23Updated 11 months ago
 - [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆69Updated last year
 - This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆51Updated 2 years ago
 - ☆46Updated last year
 - [ICLR 2023 - UNOFFICIAL] Bridging the Gap to Real-World Object-Centric Learning☆19Updated last year
 - Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆64Updated 10 months ago
 - [IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds☆80Updated last year
 - Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆114Updated 2 years ago
 - Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆68Updated last year
 - [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆71Updated last year
 - [CVPR 2024] Binding Touch to Everything: Learning Unified Multimodal Tactile Representations☆66Updated 8 months ago
 - [CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation☆40Updated last year
 - ☆11Updated 2 years ago
 - Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆53Updated 4 months ago
 - Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆44Updated 4 months ago
 - ☆33Updated last year
 - ☆39Updated last year
 - [CoRL 2023 Oral] GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields☆136Updated last year
 - VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆29Updated 8 months ago
 - HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆40Updated last month
 - Implementation of Dream to Manipulate: Compositional World Models Empowering Robot Imitation Learning with Imagination☆32Updated 5 months ago
 - ☆59Updated 10 months ago
 - [ECCV 2024] 🎉 Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipu…☆91Updated 11 months ago