huiwon-jang / RSPLinks
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
☆24Updated last year
Alternatives and similar repositories for RSP
Users that are interested in RSP are comparing it to the libraries listed below
Sorting:
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆117Updated 2 years ago
- Code for Stable Control Representations☆26Updated 8 months ago
- Official Code for Neural Systematic Binder☆33Updated 2 years ago
- ☆86Updated 4 months ago
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆32Updated 10 months ago
- ☆60Updated 2 years ago
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆29Updated 9 months ago
- ☆52Updated last week
- (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆22Updated 9 months ago
- ☆11Updated 2 years ago
- [WIP] Code for LangToMo☆20Updated 5 months ago
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆18Updated 8 months ago
- ☆46Updated last year
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆51Updated 2 years ago
- ☆23Updated last month
- Official PyTorch implementation of AdaFlow☆62Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆45Updated 2 years ago
- Masked World Models for Visual Control☆131Updated 2 years ago
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆57Updated 7 months ago
- ☆44Updated last year
- ☆60Updated last year
- PyTorch implementation of the Hiveformer research paper☆49Updated 2 years ago
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Updated last year
- ☆46Updated 2 years ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆111Updated 8 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated last year
- Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)☆129Updated 2 years ago
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆23Updated 2 years ago
- [MMM 2025 Best Paper] RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling☆22Updated last year
- Official code for Slot-Transformer for Videos (STEVE)☆50Updated 2 years ago