(CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
☆24Mar 11, 2025Updated last year
Alternatives and similar repositories for SlotMIM
Users that are interested in SlotMIM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of: "PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning" by…☆22Apr 1, 2026Updated 2 months ago
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆27Oct 28, 2024Updated last year
- Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)☆27Nov 27, 2024Updated last year
- ☆13Nov 1, 2023Updated 2 years ago
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆97Mar 10, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICCV 2025] 2D version of Dense Policy (DSP)☆33Jan 14, 2026Updated 4 months ago
- List of papers on video-centric robot learning☆23Nov 16, 2024Updated last year
- [RSS 2024] Learning Manipulation by Predicting Interaction☆120Jul 2, 2025Updated 11 months ago
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆40Feb 12, 2025Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆48Oct 29, 2023Updated 2 years ago
- [ICML 2025] Rethinking Latent Redundancy in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation☆51Feb 3, 2026Updated 4 months ago
- [ICRA 2025] CAGE: Causal Attention Enables Data-Efficient Generalizable Robotic Manipulation☆36Jan 14, 2025Updated last year
- Official implementation of the ICML 2025 paper "SOLD: Slot Object-Centric Latent Dynamics Models for Relational Manipulation Learning fro…☆20Sep 30, 2025Updated 8 months ago
- [CVPR 2025] RoboGround: Robotic Manipulation with Grounded Vision-Language Priors☆47May 25, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- RL training scripts for learning an agent using ProcTHOR.☆36Feb 18, 2025Updated last year
- [CVPR 2023] SGTAPose : Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation from Image Sequence☆19Jan 18, 2024Updated 2 years ago
- Official implementation of: "Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions" by Villar-Corrales et al…☆25Oct 16, 2023Updated 2 years ago
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆24Aug 19, 2023Updated 2 years ago
- [ICLR 2025🎉] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Lar…☆95Jan 22, 2025Updated last year
- ☆13May 28, 2025Updated last year
- [CoRL 2025] RISE-2: A Generalizable Imitation Learning Policy☆61Nov 29, 2025Updated 6 months ago
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆113Nov 21, 2024Updated last year
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆135Sep 8, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for "Behavior Generation with Latent Actions" (ICML 2024 Spotlight)☆209Feb 28, 2024Updated 2 years ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆32Oct 26, 2022Updated 3 years ago
- YAICON 3rd project page - 4D Gaussian for Head Reconstruction☆11Dec 22, 2023Updated 2 years ago
- ☆34May 14, 2025Updated last year
- A real-world autonomous driving simulator based on 3D Gaussian Splatting for scene augmentation☆16Jun 10, 2024Updated last year
- Simple python rasterizer tool implemented by OpenGL and C++☆15Nov 10, 2025Updated 6 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆83Dec 12, 2024Updated last year
- CPU based on MIPS with 5-stage pipeline and cache, working with DDR2 memory and SD card.☆32Sep 9, 2020Updated 5 years ago
- (ICCV 2025) "Principal Components" Enable A New Language of Images☆85Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆99Sep 4, 2024Updated last year
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆176Jun 19, 2025Updated 11 months ago
- Pytorch implementation of the Gato paper from Deepmind☆12Feb 8, 2023Updated 3 years ago
- D-JEPA on ImageNet☆22Nov 18, 2024Updated last year
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆71Mar 27, 2026Updated 2 months ago
- ☆16Feb 13, 2025Updated last year