(CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
☆24Mar 11, 2025Updated last year
Alternatives and similar repositories for SlotMIM
Users that are interested in SlotMIM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of: "PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning" by…☆19Apr 1, 2026Updated last week
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆27Oct 28, 2024Updated last year
- Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)☆26Nov 27, 2024Updated last year
- ☆13Nov 1, 2023Updated 2 years ago
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆97Mar 10, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICCV 2025] 2D version of Dense Policy☆33Jan 14, 2026Updated 2 months ago
- List of papers on video-centric robot learning☆22Nov 16, 2024Updated last year
- [RSS 2024] Learning Manipulation by Predicting Interaction☆120Jul 2, 2025Updated 9 months ago
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆35Feb 12, 2025Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆47Oct 29, 2023Updated 2 years ago
- ☆27Mar 6, 2025Updated last year
- [ICML 2025] Rethinking Latent Redundancy in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation☆49Feb 3, 2026Updated 2 months ago
- [ICRA 2025] CAGE: Causal Attention Enables Data-Efficient Generalizable Robotic Manipulation☆37Jan 14, 2025Updated last year
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆43May 25, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- RL training scripts for learning an agent using ProcTHOR.☆36Feb 18, 2025Updated last year
- [CVPR 2023] SGTAPose : Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation from Image Sequence☆19Jan 18, 2024Updated 2 years ago
- Official implementation of: "Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions" by Villar-Corrales et al…☆24Oct 16, 2023Updated 2 years ago
- [ICLR 2025🎉] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Lar…☆93Jan 22, 2025Updated last year
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆24Aug 19, 2023Updated 2 years ago
- Converts MimicGen dataset into LeRobot format, to train and evaluate the ACT, BC, and diffusion policies☆24Nov 19, 2024Updated last year
- [CoRL 2025] RISE-2: A Generalizable Imitation Learning Policy☆60Nov 29, 2025Updated 4 months ago
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆108Nov 21, 2024Updated last year
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆132Sep 8, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official code for "Behavior Generation with Latent Actions" (ICML 2024 Spotlight)☆202Feb 28, 2024Updated 2 years ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆32Oct 26, 2022Updated 3 years ago
- YAICON 3rd project page - 4D Gaussian for Head Reconstruction☆11Dec 22, 2023Updated 2 years ago
- ☆34May 14, 2025Updated 10 months ago
- A real-world autonomous driving simulator based on 3D Gaussian Splatting for scene augmentation☆16Jun 10, 2024Updated last year
- Simple python rasterizer tool implemented by OpenGL and C++☆15Nov 10, 2025Updated 4 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆82Dec 12, 2024Updated last year
- CPU based on MIPS with 5-stage pipeline and cache, working with DDR2 memory and SD card.☆32Sep 9, 2020Updated 5 years ago
- (ICCV 2025) "Principal Components" Enable A New Language of Images☆81Jul 28, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆96Sep 4, 2024Updated last year
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆173Jun 19, 2025Updated 9 months ago
- Pytorch implementation of the Gato paper from Deepmind☆12Feb 8, 2023Updated 3 years ago
- ☆15Feb 13, 2025Updated last year
- Library for the training and evaluation of object-centric models (ICML 2022)☆71Apr 30, 2023Updated 2 years ago
- The PyTorch implementation of AlignSeg.☆21Feb 26, 2025Updated last year