huiwon-jang / RSPLinks
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
☆23Updated 11 months ago
Alternatives and similar repositories for RSP
Users that are interested in RSP are comparing it to the libraries listed below
Sorting:
- Code for Stable Control Representations☆26Updated 6 months ago
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆114Updated 2 years ago
- (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆19Updated 7 months ago
- ☆60Updated 2 years ago
- [WIP] Code for LangToMo☆20Updated 4 months ago
- ☆85Updated 2 months ago
- Official Code for Neural Systematic Binder☆33Updated 2 years ago
- ☆46Updated last year
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆29Updated 7 months ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆107Updated 6 months ago
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆31Updated 8 months ago
- Masked World Models for Visual Control☆131Updated 2 years ago
- ☆45Updated last year
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆58Updated 5 months ago
- Implementation of Latent Diffusion Planning (Amber Xie, Oleh Rybkin, Dorsa Sadigh, Chelsea Finn)☆51Updated 4 months ago
- ☆44Updated last year
- Official code for Slot-Transformer for Videos (STEVE)☆50Updated 2 years ago
- Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)☆125Updated 2 years ago
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Updated last year
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆17Updated 6 months ago
- Official PyTorch implementation of AdaFlow☆59Updated 11 months ago
- ☆81Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆46Updated 2 years ago
- ☆59Updated 10 months ago
- ☆11Updated 2 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated last year
- [CoRL 2025] Pretraining code for FLOWER VLA on OXE☆14Updated last month
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆67Updated last year
- ☆13Updated 5 months ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆83Updated 6 months ago