(CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
☆24Mar 11, 2025Updated last year
Alternatives and similar repositories for SlotMIM
Users that are interested in SlotMIM are comparing it to the libraries listed below
Sorting:
- Official implementation of: "PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning" by…☆17Jun 2, 2025Updated 9 months ago
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆27Oct 28, 2024Updated last year
- Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)☆26Nov 27, 2024Updated last year
- ☆13Nov 1, 2023Updated 2 years ago
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆97Mar 10, 2025Updated last year
- [ICCV 2025] 2D version of Dense Policy☆33Jan 14, 2026Updated 2 months ago
- List of papers on video-centric robot learning☆22Nov 16, 2024Updated last year
- [RSS 2024] Learning Manipulation by Predicting Interaction☆120Jul 2, 2025Updated 8 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆46Oct 29, 2023Updated 2 years ago
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆34Feb 12, 2025Updated last year
- ☆27Mar 6, 2025Updated last year
- [ICML 2025] Rethinking Latent Redundancy in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation☆49Feb 3, 2026Updated last month
- [ICRA 2025] CAGE: Causal Attention Enables Data-Efficient Generalizable Robotic Manipulation☆37Jan 14, 2025Updated last year
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆43May 25, 2025Updated 9 months ago
- RL training scripts for learning an agent using ProcTHOR.☆37Feb 18, 2025Updated last year
- [CVPR 2023] SGTAPose : Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation from Image Sequence☆19Jan 18, 2024Updated 2 years ago
- Official implementation of: "Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions" by Villar-Corrales et al…☆23Oct 16, 2023Updated 2 years ago
- Official implementation for "Cluster-wise Graph Transformer with Dual-granularity Kernelized Attention" (NeurIPS2024 Spotlight)☆17Oct 10, 2024Updated last year
- [ICLR 2025🎉] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Lar…☆93Jan 22, 2025Updated last year
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆24Aug 19, 2023Updated 2 years ago
- Converts MimicGen dataset into LeRobot format, to train and evaluate the ACT, BC, and diffusion policies☆23Nov 19, 2024Updated last year
- [CoRL 2025] RISE-2: A Generalizable Imitation Learning Policy☆60Nov 29, 2025Updated 3 months ago
- ☆14May 28, 2025Updated 9 months ago
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆133Sep 8, 2025Updated 6 months ago
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆108Nov 21, 2024Updated last year
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆32Oct 26, 2022Updated 3 years ago
- Official code for "Behavior Generation with Latent Actions" (ICML 2024 Spotlight)☆200Feb 28, 2024Updated 2 years ago
- YAICON 3rd project page - 4D Gaussian for Head Reconstruction☆11Dec 22, 2023Updated 2 years ago
- ☆34May 14, 2025Updated 10 months ago
- A real-world autonomous driving simulator based on 3D Gaussian Splatting for scene augmentation☆16Jun 10, 2024Updated last year
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆79Dec 12, 2024Updated last year
- ☆96Sep 4, 2024Updated last year
- (ICCV 2025) "Principal Components" Enable A New Language of Images☆80Jul 28, 2025Updated 7 months ago
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆173Jun 19, 2025Updated 9 months ago
- Pytorch implementation of the Gato paper from Deepmind☆12Feb 8, 2023Updated 3 years ago
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆66Dec 22, 2025Updated 2 months ago
- ☆14Feb 13, 2025Updated last year
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆33Sep 30, 2025Updated 5 months ago