huiwon-jang / RSP
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
☆18Updated 5 months ago
Alternatives and similar repositories for RSP:
Users that are interested in RSP are comparing it to the libraries listed below
- Code for Stable Control Representations☆24Updated last month
- Official Code for Neural Systematic Binder☆32Updated 2 years ago
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆27Updated 2 months ago
- ☆56Updated 2 years ago
- 📎 + 🦾 CLIP-RT: Learning Language-Conditioned Robotic Policies from Natural Language Supervision☆15Updated 5 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated 7 months ago
- Official code for Slot-Transformer for Videos (STEVE)☆49Updated 2 years ago
- ☆42Updated last year
- Personal Python toolbox☆16Updated 9 months ago
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆48Updated this week
- ☆46Updated 4 months ago
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆11Updated last year
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆50Updated last year
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆108Updated last year
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆14Updated 3 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆85Updated last year
- ☆44Updated last year
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…☆17Updated last year
- ☆41Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆42Updated last year
- (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆12Updated last month
- ☆23Updated last year
- ☆69Updated 8 months ago
- Masked World Models for Visual Control☆122Updated last year
- ☆22Updated 2 years ago
- PyTorch implementation of RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling (MMM2025 Best Paper)☆17Updated 9 months ago
- [NeurIPS 2022] code for "Visual Concepts Tokenization"☆21Updated 2 years ago
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆22Updated 2 years ago
- The repository of ICML2023 paper: On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline☆23Updated last year
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆23Updated last year