huiwon-jang / RSPLinks
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
☆23Updated 11 months ago
Alternatives and similar repositories for RSP
Users that are interested in RSP are comparing it to the libraries listed below
Sorting:
- Code for Stable Control Representations☆26Updated 7 months ago
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆115Updated 2 years ago
- ☆60Updated 2 years ago
- Official Code for Neural Systematic Binder☆33Updated 2 years ago
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆29Updated 8 months ago
- ☆46Updated last year
- (CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning☆20Updated 8 months ago
- ☆86Updated 3 months ago
- [WIP] Code for LangToMo☆20Updated 4 months ago
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Updated last year
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆51Updated 2 years ago
- ☆45Updated 2 years ago
- Personal Python toolbox☆16Updated last year
- Masked World Models for Visual Control☆131Updated 2 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated last year
- ☆11Updated 2 years ago
- Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)☆126Updated 2 years ago
- ☆84Updated last year
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆58Updated 6 months ago
- ☆44Updated last year
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆67Updated last year
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆84Updated 7 months ago
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆18Updated 10 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆46Updated 2 years ago
- The repository of ICML2023 paper: On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline☆23Updated 2 years ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆110Updated 7 months ago
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆23Updated 2 years ago
- ☆12Updated 2 years ago
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆32Updated 9 months ago
- Official code for Slot-Transformer for Videos (STEVE)☆50Updated 2 years ago